In [96]:
# Import all of the things you need to import!
import pandas as pd
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.feature_extraction.text import TfidfVectorizer

Homework 14 (or so): TF-IDF text analysis and clustering

Hooray, we kind of figured out how text analysis works! Some of it is still magic, but at least the TF and IDF parts make a little sense. Kind of. Somewhat.

No, just kidding, we're professionals now.

Investigating the Congressional Record

The Congressional Record is more or less what happened in Congress every single day. Speeches and all that. A good large source of text data, maybe?

Let's pretend it's totally secret but we just got it leaked to us in a data dump, and we need to check it out. It was leaked from this page here.


In [1]:
# If you'd like to download it through the command line...
!curl -O http://www.cs.cornell.edu/home/llee/data/convote/convote_v1.1.tar.gz


  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 9607k  100 9607k    0     0  8339k      0  0:00:01  0:00:01 --:--:-- 8346k

In [2]:
# And then extract it through the command line...
!tar -zxf convote_v1.1.tar.gz

You can explore the files if you'd like, but we're going to get the ones from convote_v1.1/data_stage_one/development_set/. It's a bunch of text files.


In [3]:
# glob finds files matching a certain filename pattern
import glob

# Give me all the text files
paths = glob.glob('convote_v1.1/data_stage_one/development_set/*')
paths[:5]


Out[3]:
['convote_v1.1/data_stage_one/development_set/052_400095_1479080_ROY.txt',
 'convote_v1.1/data_stage_one/development_set/493_400189_2243032_DON.txt',
 'convote_v1.1/data_stage_one/development_set/052_400011_1479046_DON.txt',
 'convote_v1.1/data_stage_one/development_set/421_400333_2010010_DON.txt',
 'convote_v1.1/data_stage_one/development_set/199_400300_2013031_DON.txt']

In [4]:
len(paths)


Out[4]:
702

So great, we have 702 of them. Now let's import them.


In [6]:
speeches = []
for path in paths:
    with open(path) as speech_file:
        speech = {
            'pathname': path,
            'filename': path.split('/')[-1],
            'content': speech_file.read()
        }
    speeches.append(speech)
speeches_df = pd.DataFrame(speeches)
speeches_df.head()


Out[6]:
content filename pathname
0 mr. chairman , i yield myself such time as i m... 052_400095_1479080_ROY.txt convote_v1.1/data_stage_one/development_set/05...
1 i yield to the gentleman from texas . \n 493_400189_2243032_DON.txt convote_v1.1/data_stage_one/development_set/49...
2 mr. chairman , i do not have it on the top of ... 052_400011_1479046_DON.txt convote_v1.1/data_stage_one/development_set/05...
3 mr. speaker , i yield 3 minutes to the gentlem... 421_400333_2010010_DON.txt convote_v1.1/data_stage_one/development_set/42...
4 mr. speaker , let me conclude on this side by ... 199_400300_2013031_DON.txt convote_v1.1/data_stage_one/development_set/19...

In class we had the texts variable. For the homework can just do speeches_df['content'] to get the same sort of list of stuff.

Take a look at the contents of the first 5 speeches


In [7]:
speeches_df['content'].head(5)


Out[7]:
0    mr. chairman , i yield myself such time as i m...
1             i yield to the gentleman from texas . \n
2    mr. chairman , i do not have it on the top of ...
3    mr. speaker , i yield 3 minutes to the gentlem...
4    mr. speaker , let me conclude on this side by ...
Name: content, dtype: object

Doing our analysis

Use the sklearn package and a plain boring CountVectorizer to get a list of all of the tokens used in the speeches. If it won't list them all, that's ok! Make a dataframe with those terms as columns.

Be sure to include English-language stopwords


In [34]:
count_vectorizer = CountVectorizer(stop_words='english')

In [35]:
X=count_vectorizer.fit_transform(speeches_df['content'])

In [36]:
len(count_vectorizer.get_feature_names())


Out[36]:
9106

In [25]:
tokens_df=pd.DataFrame(X.toarray(), columns=count_vectorizer.get_feature_names())
tokens_df


Out[25]:
000 00007 018 050 092 10 100 106 107 108 ... youngsters youth yuan zero zeroing zeros zigler zirkin zoe zoellick
0 2 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
2 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
3 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
4 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
5 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
6 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
7 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
8 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
9 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
10 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
11 1 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
12 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
13 1 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
14 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
15 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
16 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
17 0 0 0 0 0 0 1 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
18 0 0 0 0 0 1 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
19 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
20 1 0 0 0 0 1 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
21 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
22 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
23 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
24 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
25 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
26 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
27 0 0 0 0 0 0 0 0 0 0 ... 0 0 2 0 0 0 0 0 0 0
28 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
29 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
672 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
673 0 0 0 0 0 0 1 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
674 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
675 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
676 1 0 0 0 0 1 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
677 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
678 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
679 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
680 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
681 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
682 0 0 0 0 0 0 1 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
683 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
684 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
685 2 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
686 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
687 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
688 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
689 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
690 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
691 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
692 4 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
693 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
694 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
695 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
696 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
697 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
698 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
699 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
700 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
701 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0

702 rows × 9106 columns

Okay, it's far too big to even look at. Let's try to get a list of features from a new CountVectorizer that only takes the top 100 words.


In [40]:
count_vectorizer = CountVectorizer(stop_words='english', max_features=100)

In [41]:
X=count_vectorizer.fit_transform(speeches_df['content'])
len(count_vectorizer.get_feature_names())


Out[41]:
100

Now let's push all of that into a dataframe with nicely named columns.


In [42]:
tokens_df=pd.DataFrame(X.toarray(), columns=count_vectorizer.get_feature_names())
tokens_df


Out[42]:
000 11 act allow amendment america american amp association balance ... trade united urge vote want way work year years yield
0 2 0 0 0 1 0 0 0 0 1 ... 0 0 1 0 0 0 0 0 0 2
1 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
2 0 0 0 1 0 0 1 0 0 0 ... 0 1 0 0 0 0 0 0 0 0
3 0 0 0 0 0 0 0 0 0 0 ... 1 0 0 0 0 0 0 0 0 1
4 0 0 0 0 1 0 2 0 0 0 ... 0 2 1 1 0 0 0 4 0 0
5 0 0 0 0 2 0 0 0 0 0 ... 0 0 2 0 0 0 0 0 0 0
6 0 1 2 0 0 0 0 0 0 0 ... 0 0 1 2 2 2 2 0 0 0
7 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
8 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
9 0 0 2 0 3 0 0 0 0 0 ... 0 0 0 0 1 0 0 0 0 0
10 0 0 0 0 0 0 1 0 0 0 ... 5 0 0 3 1 0 0 0 1 2
11 1 0 0 2 9 0 0 0 0 1 ... 0 0 0 6 1 1 0 2 0 1
12 0 0 0 1 1 0 0 0 0 0 ... 0 0 1 3 0 0 0 0 0 1
13 1 0 3 0 0 1 4 0 7 1 ... 0 1 0 0 1 0 0 1 2 1
14 0 0 2 0 0 0 0 0 0 0 ... 4 2 1 0 0 0 0 1 0 0
15 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
16 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 1 0 0 0 0 0 0
17 0 0 2 0 3 0 2 0 0 0 ... 0 0 1 4 0 1 0 1 0 0
18 0 44 13 1 0 1 3 3 4 1 ... 0 11 4 4 0 0 0 2 2 1
19 0 0 0 1 1 2 0 0 0 0 ... 0 0 0 0 2 0 2 0 1 0
20 1 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 4 0 2 2 0 0
21 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 2 0 0 0 0 0 1
22 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
23 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
24 0 0 1 0 2 0 0 0 0 0 ... 0 0 0 0 0 1 0 1 1 0
25 0 0 1 1 4 0 0 0 0 0 ... 0 0 1 0 0 1 0 1 1 0
26 0 0 1 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 1 0 1 0
27 0 0 2 4 0 1 15 0 0 0 ... 9 5 1 4 0 0 0 0 0 0
28 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
29 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 1 0 0 0 0 0 0
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
672 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
673 0 0 3 0 0 1 1 0 0 1 ... 0 0 1 0 0 1 0 0 0 1
674 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
675 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 0
676 1 0 0 0 2 0 0 0 0 0 ... 0 1 1 0 1 1 0 0 1 2
677 0 0 0 0 0 0 0 0 0 1 ... 0 0 0 0 0 0 0 0 0 0
678 0 0 2 0 0 0 2 0 1 0 ... 14 2 2 0 0 3 2 0 0 0
679 0 0 0 0 0 0 0 0 0 1 ... 0 0 0 0 0 0 0 0 0 0
680 0 0 0 0 2 0 0 0 0 1 ... 0 0 0 1 0 0 0 0 0 1
681 0 0 2 0 3 0 0 0 0 1 ... 0 0 0 0 1 0 3 3 3 1
682 0 0 2 0 0 0 0 0 0 0 ... 6 3 0 0 0 0 0 0 0 0
683 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
684 0 0 0 0 0 0 0 0 0 0 ... 1 0 1 1 0 0 0 0 1 0
685 2 0 0 0 2 0 0 0 0 1 ... 0 0 0 0 0 0 0 0 0 1
686 0 0 0 0 3 0 0 0 0 0 ... 0 1 1 2 0 0 0 3 0 0
687 0 0 1 0 0 0 0 0 0 0 ... 7 1 0 1 0 0 0 0 0 0
688 0 0 0 0 0 0 0 0 0 1 ... 0 0 0 0 0 0 0 0 0 1
689 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
690 0 0 1 0 0 3 2 0 0 0 ... 14 0 0 2 1 0 0 1 2 0
691 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
692 4 0 1 2 3 0 1 0 0 0 ... 0 0 0 3 3 2 0 4 1 0
693 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 2
694 0 0 4 0 0 0 0 0 0 1 ... 0 0 0 0 1 0 0 0 0 1
695 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 1
696 0 0 0 0 0 0 0 0 0 0 ... 0 0 0 1 0 1 0 3 3 0
697 0 0 0 1 4 1 0 0 0 0 ... 0 0 1 1 0 0 0 0 0 0
698 0 0 0 0 1 0 0 0 0 0 ... 0 0 0 0 0 0 0 0 0 2
699 0 0 0 0 1 0 0 0 0 0 ... 0 0 0 0 1 0 0 0 0 0
700 0 6 0 0 0 0 0 0 0 0 ... 0 0 0 0 1 0 0 1 0 1
701 0 1 3 0 0 0 3 0 0 0 ... 0 1 0 0 1 1 0 0 0 1

702 rows × 100 columns

Everyone seems to start their speeches with "mr chairman" - how many speeches are there total, and how many don't mention "chairman" and how many mention neither "mr" nor "chairman"?


In [44]:
# 702 rows means 702 speeches, since each speech is a single string
len(tokens_df)


Out[44]:
702

In [53]:
# if the speech doesnt contain a chairman, the column entry will be 0. so, 250 no-chairmain speeches. granted,
# we have no idea if they stared the speech with chairman or just mentioned him somewhere
len(tokens_df[tokens_df['chairman']==0])


Out[53]:
250

In [66]:
# 76 times no mr or chairman. which means they must call the chairman just 'chairman' a lot. rude!
len(tokens_df[(tokens_df['mr']==0) & (tokens_df['chairman']==0)])


Out[66]:
76

What is the index of the speech which is the most thankful, a.k.a. includes the word 'thank' the most times?


In [80]:
# so speech index 375
tokens_df[tokens_df['thank']==tokens_df['thank'].max()].index


Out[80]:
Int64Index([375], dtype='int64')

In [82]:
# lets look at the speech
speeches_df['content'][375]


Out[82]:
"mr. chairman , i just wanted to remind the house that faith-based organizations can and do sponsor federally funded head start programs . \nany sponsor who will agree not to discriminate in employment , if they can sponsor a program with the discrimination amendment , they can sponsor the program without that amendment if they would agree not to discriminate . \nwhat we are talking about is discrimination . \nsome people want to discriminate against catholics , jews , muslims , african americans . \nwe had this discussion in the 1960s , and the consensus back then was that discrimination in employment was so offensive that we made it illegal . \nthe victim needs to be protected and the weight of the federal government will fall down on the side of the victim . \nthe vote was not unanimous . \nsome people did not like it then ; they do not like it now . \nand we are discussing where should the weight of the government be , with the victim or with somebody trying to discriminate . \nthis is head start . \nwe should not give students of head start the idea that their parents were denied a federally funded job solely because of their religion . \nwe have heard of the supreme court . \nall of the supreme court decisions have said it is okay for a church to discriminate in employment with church money . \nnone have supported discrimination with direct federal funding . \nwe have heard of our forefathers in 1964 . \nwe know that since 1965 it has been illegal , at least until this administration , to discriminate with federal money . \nhead start has been reauthorized for over 40 years with the civil rights protections . \npresident clinton 's name has been invoked . \nwhat is left out is his signing statement where he said that his analysis was that they could not discriminate with the federal money under his analysis . \nthis administration has changed that analysis , but we need to make sure that president clinton 's whole signing statement is included . \nmr. chairman , i submit for printing in the record letters from numerous organizations including the national head start association which oppose the discrimination amendment and ask us to vote `` no '' on the underlying bill if they sabotage civil rights protections . \nseptember 22 , 2005 . \ndear member of congress : i have become aware that an amendment has been offered by rep . \nboustany ( r-la ) to the head start bill on the house floor today that would give faith-based organizations providing head start services the right to discriminate with federal funds against employees who are of different faiths . \nas the state president of the louisiana head start association , i strongly oppose such an amendment . \nit is a sad day when members of congress try to manipulate compassion evoked by the national tragedy in my state of louisiana caused by katrina to pass a civil rights repeal in head start or jeopardize the passage of this law so important to the children of my state and our nation . \ni know , firsthand , that head start is a model for demonstrating that a strong prohibition on religious employment discrimination with federal funds is fully compatible with federal assistance to faith-based charities . \nfaith-based organizations , like the ones i oversee , can and do fully participate in federally funded programs without discriminating in hiring with those same federal funds . \ni see no reason to change the law to allow them to use federal funds to discriminate against our employees . \nmy state 's religiously affiliated providers are more than capable and willing to honor the civil rights requirements of the head start program . \ni am greatly concerned that the provision to remove civil rights protections for employees could have a negative impact on the children and families who participate in these programs . \ntens of thousands of at-risk 3- and 4-year-old children currently in head start could lose their teachers -- who often are the most important adults to whom they have bonded , other than their parents -- not because those teachers are doing a bad job , but because they are the `` wrong '' religion . \nas the state president of the louisiana head start association , i urge you to reject the boustany amendment to allow discrimination in head start . \nsuch a provision is incompatible with the mission of this program . \nsincerely , & lt ; center & gt ; barbara pickney , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; st . \nlandry parish head start program , state president of the louisiana head start association . \n& lt ; /em & gt ; national head start association , alexandria , va , september 19 , 2005 . \ndear chairman boehner and ranking member miller : on behalf of the more than 2.5 million children and families , program staff and volunteers that comprise the head start and early head start community , we are writing to you today to address certain issues regarding the reauthorization of the head start act . \nwe appreciate the bi-partisan spirit that has occurred throughout this crafting of the reauthorization bill . \nh.r. 2123 does not contain the controversial block grant proposal of the 108th congress and maintains the crucial comprehensive services of the head start program performance standards . \nwe applaud a number of measures and improvements incorporated into this bill , such as enhanced homeless outreach ; greater set asides for migrant and seasonal workers and native americans , as well as early head start programs ; and the addition of a `` seamless service '' provision that allows programs to convert head start slots to early head start slots under certain circumstances . \nwhile the recompetition provision is not perfect , we appreciate that its intent is not to recompete all programs , but to recompete only failing programs . \nwe also acknowledge that the teacher requirements are based on national goals and that training and technical assistance is funded at two percent , with 50 percent of that amount going directly to programs . \nwhile we generally are pleased with the overall intent and direction of h.r. 2123 , we do have continuing concerns about certain specific provisions that we hope that can be resolved before the bill is enacted into law . \nthese concerns are discussed in greater detail below . \nrecompetition procedures , which are laid out in detail in section 641 ( c ) ( 1 ) - ( 19 ) include several areas that are problematic . \nwhile we strongly agree that programs that are not providing high quality services should have to recompete for head start funds , we are concerned that the language in this section may force more programs -- regardless of quality -- to undergo recompetition . \nwe believe that there should be a strong message that all programs must be high performing . \nyet , we also believe that programs that are providing high quality services should not be put in the position of recompeting every five years , as this instability makes it difficult for them to recruit and retain the best teachers , to invest in facilities , and to create lasting partnerships with other community agencies . \nwhile we appreciate the efforts to make the recompetition process fair , there remains a very long list of tests that must be met to determine the priority status of programs . \nwe continue to have concerns that some of these tests could be evaluated in an arbitrary manner , throwing programs into a recompete status , regardless of their performance . \nthe head start community does not want to see failing programs continue , but we would like reassurances that the recompetition process will be unbiased and consistent in its application by the bureau . \nto achieve this , we would prefer that there be more limited parameters to determine the need to recompete a grantee , such as programs that have unresolved areas of noncompliance . \nthe entire head start community is committed to raising the bar when it comes to improving quality and enhancing teacher and staff credentials . \nadditionally , educational levels among head start teachers have increased appreciably since the 1998 congressional mandate to increase the proportion of head start teachers with an a.a . \ndegree . \nfifty-seven percent of head start teachers had at least an a.a . \ndegree in 2003 , exceeding a congressional mandate that 50 percent of head start teachers in center-based classrooms attain an a.a . \ndegree or higher by september 2003 . \nmost head start teachers without degrees were working toward them . \nfifty-eight percent of head start teachers without a degree or credential were enrolled in an early childhood education or related degree program , and 18 percent were in child development associate ( cda ) or equivalent training . \na key to head start 's success in meeting the 1998 mandate was that congress also increased funding , which provided scholarships , release time and qualified substitutes , teacher salary increases , and other quality enhancement supports . \nthe 1998 law required that , when funding for the program increased , a certain percentage of new dollars would be dedicated to quality . \nin the following years , funding for the head start program grew and , as a result , funds available for quality activities increased . \nhowever , head start funding has not kept pace with inflation in recent years , so programs no longer have a growing source of funds to help teachers attain degrees . \nadditional funding will be needed to meet a programs must have the resources to help teachers gain their credentials and to pay salaries at a high enough level to recruit and retain teachers with the required degree . \nwithout new money for teacher salaries , increased credentialing for teachers should not be mandatory . \nwhile we appreciate the modifications made in committee markup to the provisions regarding the head start parent policy councils , we strongly believe in the integral and shared responsibilities of board members and parents in head start governing bodies . \nthe high degree of parental involvement in the head start program has provided a role model for early childhood education for 40 years . \nthe head start community is fully committed to restoration of the current level of authority to parent policy councils . \nthe nrs , a pre- and post-test for head start children , is not a valid measurement of program impact and should not be used in this manner . \nbecause head start serves children with very high level needs , using this kind of measure to evaluate programs may well penalize those programs serving the children with the greatest needs . \nfurther , as pointed out in a may 2005 general accountability office report , the nrs was found to be invalid and unreliable . \nthe gao also confirmed that the nrs is not an appropriate evaluation vehicle for children who are english language learners , especially those who speak neither english nor spanish . \nadditionally , we know that the head start bureau is spending more than $ 21 million annually on the nrs , an expenditure that does not even begin to take into consideration the costs of preparing for and administering the test at the program level . \nwe ask the house of representatives to suspend further use of and expenditures for the nrs until the national academy of sciences can make the test scientifically valid . \nh.r. 2123 contains a provision that the head start community believes is punitive and unreasonable to all head start programs . \nthe process and planning that is required of program administrators for a full prism review can not be performed overnight . \nthe head start community has no objection to unannounced site visits when they concern health and safety issues or are following up on prior compliance matters . \nnhsa believes that a minimum of 30 days notice should be required of the head start bureau before full prism reviews . \nhigh quality training is critically important to improving and sustaining head start quality and childhood outcomes . \nh.r. 2123 limits the ability of parents and staff to travel in order to receive specialized training and career development at national conferences . \nthis is an unnecessary provision that will cause confusion for program administrators since the existing grant application process requires justification of all training . \nwhile the head start community strives for sound collaboration with their respective state officials , it is critically important that state officials reciprocate in these collaborative efforts . \nh.r. 2123 does not require input as it should , and as is now required , from state head start officials in the process of selecting staff who will have coordination responsibilities . \nthe head start community believes that state head start associations should have sign-off on candidates for state collaboration officers , as well as continuing involvement in the planning and implementation of state plans . \nfurthermore , there should be clarification regarding states that have existing state advisory councils , namely that they are permitted to modify them to meet the requirements in the bill . \nthe head start community , including a number of programs administered by religious organizations , strongly opposes any effort by this administration to encourage religious discrimination in hiring practices for head start or any federally-funded program . \nfreedom of religion , a cornerstone of this great nation , should be sacrosanct to all of us . \nit is incomprehensible that the u.s. congress would tamper with the ability of its citizens to practice their faith by using the threat of employment discrimination . \nin spite of its positive provisions , if h.r. 2123 contains a religious discrimination amendment , we must reluctantly oppose the bill . \nin closing , we commend the education and workforce committee for their bi-partisan efforts in this head start reauthorization bill and we hope that modifications will be made that will result in improvements to the program . \nsincerely , ministers in action , washington , dc , september 16 , 2005 . \ndear member of congress : as pastors and leaders of predominately african american congregations across the country , we urge you to stand up for the civil rights and religious freedom of all americans , and to maintain the bipartisan direction of the school readiness act ( h.r. 2123 ) by opposing any attempt to repeal longstanding critical civil rights protections on the house floor . \nthis bill maintains provisions designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded head start programs . \nwe have continually supported these provisions because this is consistent with our commitment to protecting the as religious figures we provide leadership grounded by theological interpretations of scripture , and focus on issues of concern to our parishioners and our community . \nwe agree that religious organizations participating in the head start program make an invaluable contribution to the education of thousands of students in minority communities in particular , but do not agree that discriminating against persons based upon their religion is necessary or desirable in order to provide these much needed services . \nwe are optimistic that this bill can gain broad support among religious , civil rights , labor , education , health , and advocacy organizations , but this broad support will end if there is any threat to remove the longstanding critical civil rights protections in head start . \nin particular , we are seriously concerned about a statement released by the committee on education and the workforce on may 5 , 2005 , in which chairman boehner stated that he foresees an amendment on the house floor to rollback longstanding critical civil rights protections . \nin light of this statement , we are asking members to oppose this amendment and not support the head start bill if the anti-discrimination provisions are removed . \nas leaders of our respective congregations we are committed to providing much needed services in our communities and have done so by respecting the rights of all individuals . \ntherefore , we find it particularly insulting to suggest that it is necessary to remove civil rights protections from head start programs in order for this outreach to continue . \nfurthermore , we can not compromise our principles by supporting a program that allows organizations , including religiously-affiliated organizations , to discriminate with federal taxpayers ' dollars . \nwe urge you to maintain the bipartisan direction of the school readiness act ( h.r. 2123 ) and to not support any agreement that allows for an assault on civil rights protections in federally-funded programs , especially a program as critical as head start . \nthis could destroy the mutually supported nature of the head start program in which the education of young children -- especially minority children -- is so dependent upon parental participation and on ongoing , close relationships with head start teachers . \nuplifting our surrounding community does not require the concurrent advancement of government funded discrimination . \nsincerely , reverend timothy mcdonald , anti-defamation league , new york , ny , september 16 , 2005 . \ndear representative : on behalf of the anti-defamation league , we write to urge you to maintain the civil rights protections currently included in the house education and the workforce-approved version of the school readiness act ( h.r. 2123 ) -- and to oppose any efforts to repeal these important provisions . \nallowing religious-based employment discrimination in federally-funded programs is wrong -- and to do it on the historic head start anti-poverty education program is deeply offensive . \nsince 1972 , agencies that receive government funding for head start -- including religious organizations and houses of worship that host head start programs -- have been prohibited from discriminating on the basis of religion when hiring or firing staff within the federally-funded program . \nthese existing non-discrimination requirements have a history of bipartisan support , and were originally signed into law by president richard nixon . \nthe current anti-discrimination language was included in the 1981 head start reauthorization bill , signed into law by president ronald reagan , and has been included in every head start reauthorization since then -- in 1984 , 1986 , 1990 , 1994 , and 1998 . \nfor 33 years , these fundamental non-discrimination protections have worked well , allowing thousands of head start programs in communities throughout the country to flourish while maintaining constitutional and civil rights safeguards against religious tests for employment in federally-funded programs . \nwe have great appreciation for the vital role religious institutions have historically played in addressing many of our nation 's most pressing social needs , as a critical complement to government-funded programs . \nfor decades , government-funded partnerships with religiously-affiliated organizations -- such as catholic charities , jewish community federations , and lutheran social services -- have helped to combat poverty and provided housing , education , and health care services for those in need . \nthese successful partnerships have provided excellent service to communities , largely unburdened by concerns over bureaucratic entanglements between government and religion . \nindeed , at the same time that safeguards have protected beneficiaries from unwanted and unconstitutional the house has never voted to repeal existing civil rights protections in a floor amendment . \nto do so on head start , an historic anti-poverty program universally acclaimed and present in so many communities across the country , is odious . \nwe urge you to oppose any attempt to remove civil rights protections from head start . \nsincerely , & lt ; center & gt ; michael lieberman , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; washington counsel. & lt ; /em & gt ; & lt ; center & gt ; jess n. hordes , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; washington director. & lt ; /em & gt ; american federation of state , county and municipal employees , afl-cio , washington , dc , september 20 , 2005 . \ndear representative : on behalf of the 1.4 million members of the american federation of state , county and municipal employees ( afscme ) , i am writing with respect to certain provisions of h.r. 2123 which would reauthorize the head start program . \nwe want to express our sincere appreciation for the bi-partisan and inclusive process that resulted in unanimous approval of the legislation at the committee level . \nsignificantly , h.r. 2123 does not include the controversial block grant proposal that derailed efforts to reauthorize head start in the last congress . \nrather , h.r. 2123 respects and maintains the crucial comprehensive services of the program performance standards that long have marked head start as a program of distinction . \nwe believe that h.r. 2123 , with some changes , has the very real potential to build upon the success of head start for future generations . \nhowever , we are concerned that this bill does not address the low pay offered to head start teachers and staff and the lack of financial assistance in meeting new and more rigorous educational requirements . \nwe support h.r. 2123 's focus on raising standards for head start teachers , including the provision calling for 50 percent of all current head start teachers to have a bachelor 's degree within five years and all new head start teachers to have an associate 's degree . \nhowever , the estimated cost of the additional education for half of all head start teachers to earn bachelor 's degrees by 2008 is approximately $ 2 billion over five years . \nif we want quality education for head start children , we must be willing to help teachers achieve this important goal . \nafscme members have worked in head start programs for decades . \nwe know that the qualifications of early childhood educators matter because high quality early education improves outcomes for children and delivers benefits to the community that far outweigh the costs . \nwe are also deeply concerned that chairman boehner intends to offer a controversial amendment on the floor to repeal longstanding civil rights protections from the head start program . \nallowing federally-funded discrimination in any program is immoral . \nbut it is especially egregious given that the civil rights protections in head start are an integral part of its mission to provide families a ladder out of poverty by encouraging parents to become volunteers and then teachers . \ndenying a parent economic opportunity because of the religion he/she practices violates the principles upon which our country was founded . \nwe strongly urge you to oppose the amendment . \nif the amendment is adopted , afscme urges you to oppose the bill on final passage . \nsincerely , charles m. loveless , on civil rights , washington , dc , september 16 , 2005 . \ndear representative : on behalf of the leadership conference on civil rights ( lccr ) , the nation 's oldest , largest , and most diverse civil and human rights coalition , with more than 190 member organizations , we urge you to oppose the boehner amendment or any amendment to the school readiness act ( h.r. 2123 ) that would repeal longstanding civil rights protections in the head start program that have been in place since president nixon signed the law in 1972 . \nwe strongly oppose any language that would allow federally-funded employment discrimination . \nif language repealing civil rights protections is added to the bill during consideration on the house floor , we urge you to oppose final passage of h.r. 2123 . \nlccr opposes allowing government-funded employment discrimination . \nreligious organizations have always served as key partners in providing government services through the head start program and current law has not been a hindrance to their vigorous participation . \nthere also is no controversy over the exemption under title vii of the civil rights act of 1964 that allows religious organizations to have a preference of hiring co-religionists when they are using private funds , but federal funds may not be used to discriminate . \nsuch a drastic change to the current head start program would be inconsistent with the long held notion that federal dollars must not be used to discriminate . \nthe boehner amendment would allow government-funded employment discrimination , although the u.s. supreme court affirmed the title vii exemption for privately-funded religious employers , it did not authorize federally-funded employment discrimination . \nsee corporation of presiding bishop of church of jesus christ of latter day saints v. amos , 483 u.s. 327 ( 1987 ) . \nwe believe , based on analysis of amos , that if federal funds are used by religious organizations to hire only persons of their own faith , then the federal government is affirmatively acting to advance employment discrimination . \nin the 60 years since franklin d. roosevelt signed the first executive order prohibiting discrimination in federally funded activity , our nation has made significant progress in the struggle to end employment discrimination and advance equality . \nany attempt to allow organizations to discriminate on the basis of religion with federal funds would drastically impede that progress and erode a longstanding principle of our nation 's civil rights policy : that federal civil rights obligations follow federal dollars , regardless of who receives them . \nthe courts have affirmed the principle that federal funds can not be used to discriminate . \nthe leading case on the question of government-aided discrimination is norwood v. harrison , 413 u.s. 455 ( 1973 ) . \nin a unanimous decision , the u.s. supreme court held that `` the constitution does not permit the state to aid discrimination. '' id . \n465-66 . \nthe principles set out in norwood were affirmed in justice o'connor 's opinion in city of richmond v. j.a . \ncroson co . \n488 u.s. 469 , 492 ( 1989 ) , which stated , lccr urges you to oppose rep . \nboehner 's amendment because current law must not be changed to allow recipients of head start funds to have an explicit statutory right to engage in employment discrimination . \nif this amendment passes , or other language is added during floor consideration that repeals current law , lccr urges you to oppose final passage of h.r. 2123 . \nif you have any questions , please contact nancy zirkin , lccr deputy director , or andrea martin , senior counsel and policy analyst regarding this or any issue important to lccr . \nsincerely , & lt ; center & gt ; wade henderson , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; executive director. & lt ; /em & gt ; & lt ; center & gt ; nancy zirkin , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; deputy director. & lt ; /em & gt ; washington bureau , national association for the advancement of colored people , washington , dc , september 19 , 2005 . \ndear member : on behalf of the national association for the advancement of colored people ( naacp ) , our nation 's oldest , largest and most widely recognized grassroots civil rights organization , i am writing today to urge you to do all you can to ensure that the longstanding , critical civil rights protections that are contained in the current version of h.r. 2123 , the school readiness act , are retained during consideration by the full house of representatives . \nspecifically , i urge you to reject and work against the anticipated boehner amendment , which will repeal existing , long-standing head start provisions that prohibit religious organizations and churches from discriminating on the basis of religion when hiring or firing staff from positions within this federally-funded program . \nh.r. 2132 , as approved by the committee on education and labor , maintains provisions designed to protect the more than 198 , 000 head start teachers , staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded head start programs . \nthe naacp again urges you to do all you can to maintain these vital protections throughout the legislative process , and that you do not support this legislation if , at any point they are stripped . \nthe critical longstanding nondiscrimination provisions have been included in head start legislation since 1981 . \nthis is a fundamental civil rights protection against employment discrimination for head start teachers and volunteers . \nthe legislation has always received strong bipartisan support from both the house and senate since its enactment in the 97th congress when president ronald reagan signed the legislation into law . \nthe twenty-four year old civil rights provision has worked well since the inception of this program , allowing religious organizations to participate in programs while maintaining constitutional and civil rights standards . \nthe naacp both recognizes and celebrates that religious organizations participating in the head start program have made and continue to make an invaluable contribution to the education of thousands of students . \nthese religious organizations have complied with head start 's existing civil rights requirements . \nhowever , if the repeal of the existing civil rights protections were to become law , teachers or parent volunteers working in any head start program run by a religious organization could immediately lose their jobs because of their religion . \nstudents participating in head start therefore could lose not only their teachers , but also the close programmatic connection with their own parents volunteering in the program . \nthe naacp strongly believes that allowing discrimination based on religion thus , i urge you again , in the strongest terms possible , to support the continued inclusion of these longstanding and critical civil rights protections . \nthe head start program is too critical to our children and our nation 's future to allow support for it to be divided by this issue . \nshould you have any questions about the naacp position or if there is any way in which i can be of help to you as you move this reauthorization through the legislative process , i hope that you will feel free to contact me . \nthank you very much for you attention to the views of the naacp . \nsincerely , hilary o. shelton , the american jewish committee , washington , dc , september 19 , 2005 . \ndear representative : on behalf of the american jewish committee , the nation 's oldest human relations organization , with 33 chapters nationwide representing over 150 , 000 members and supporters , i urge you to oppose any amendments to the school readiness act , h.r. 2123 , that roll back crucial civil rights safeguards . \nfurther , if such an amendment is adopted , i urge you to oppose passage of h.r. 2123 ; repealing this longstanding essential element of head start could subject teachers in these federally-funded programs to religious discrimination . \nas passed out of the house education and the workforce committee , the bill maintains three-decade-old provisions that prohibit various forms of employment discrimination in head start . \nboth religious and secular organizations have operated effectively under this system since it passed as part of bipartisan legislation passed during the 9th congress . \never since president richard nixon signed the legislation into law in 1972 , religion-based and other forms of discrimination are prohibited in head start programs , thereby ensuring that taxpayer dollars do not underwrite positions for which religion is a factor in hiring decisions . \nat the same time , the existing provisions do not intrude on the autonomy of religious organizations with respect to hiring decisions made in purely private programs . \nthe efforts of the house education and the workforce committee to produce a bipartisan package are to be commended . \nthe bill that reaches the house floor has the potential to receive broad support among religious , civil rights , labor , education , and health organizations . \nhowever , the bill risks losing critical segments of this support if , at any point , this initiative is amended to roll back head start 's longstanding civil rights protections by exempting religious organizations from the prohibition on religious discrimination in employment decisions . \nif so amended , h.r. 2123 would compromise an extremely successful program that provides essential services to nearly one million at-risk children nationwide . \nwhile many of the religious organizations that deliver the program would , no doubt , continue to hire employees for head start programs without regard to religion , h.r. 2123 could jeopardize the jobs of many thousands of current and potential teachers , staff , and parent volunteers for belonging to the `` wrong '' religion , as well as jeopardize children for whom a stable and trusting relationship between teacher and child is so important . \nfor these reasons , we strongly urge you to oppose any attempts to roll back the vital civil rights protections of h.r. 2123 , the school readiness act . \nthank you for considering our views on this important matter . \nrespectfully , richard t. foltin , of church and state , washington , dc , september 19 , 2005 . \ndear representative : americans united for separation of church and state urges you to oppose any amendment to repeal longstanding , critical civil rights protections contained in the school readiness act ( h.r. 2123 ) and to vote `` no '' on final passage of the bill if such an amendment is adopted . \namericans united represents more than 75 , 000 individual members throughout the fifty states , 9500 clergy nationwide , as well as cooperating houses of worship and other religious bodies committed to the preservation of religious liberty . \nh.r. 2123 unanimously passed out of the committee on education and the workforce on may 18 , 2005 , maintaining a longstanding civil rights provision designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded head start programs . \nwe are pleased with this bipartisan legislation thus far , but are deeply concerned about stated threats to repeal longstanding civil rights protections against religious discrimination in our nation 's head start programs on the house floor . \nspecifically , chairman boehner , after championing the committee-passed bill , stated that an amendment may be offered on the house floor that would repeal these protections . \nwe urge you to reject attempts to sabotage a bipartisan effort to reauthorize the america 's head start programs with such a divisive anti-civil rights amendment . \nwe recognize that religious organizations participating in the head start program make an invaluable contribution to the education of thousands of children . \nthese organizations have complied with head start 's existing civil rights requirements without controversy . \nhowever , if the repeal of the existing civil rights protection were to become law , teachers or parent volunteers working in any head start program run by a religious organization could immediately lose their jobs simply because of their religion or religious beliefs . \nthis would directly work against the stated goals of head start and could change the fundamental character of this tremendously successful program . \naccording to the latest study from the national head start association , the program currently enjoys a soaring 96 percent parental satisfaction rate . \nthe parents and communities that rely on head start programs should not have to choose between the renewal of the head start program and longstanding civil rights protections that are a cornerstone of this invaluable program . \nwe hope that the house will continue the bipartisan goal of reauthorizing our nation 's head start programs and reject any attempts to roll back the civil rights protections long afforded to head start teachers and staff . \nif you have any questions about h.r. 2123 or would like further information on any other issue of importance to americans united , please contact aaron d. schuham , legislative director . \nsincerely , rev . \nbarry w. lynn , for religious liberty , washington , dc , september 16 , 2005 . \ndear representative , the school readiness act of 2005 ( h.r. 2123 ) will soon be considered in the house . \nwe write to urge you to oppose any effort to amend this bipartisan bill in a manner that would repeal current protections against religious discrimination . \nthe current bill , passed out of committee with unanimous approval , maintains these important protections . \nunfortunately , repeated public statements have assured plans for a floor amendment that would allow religious discrimination in federally funded positions . \nwe ask you to oppose any such amendment and to oppose final passage of the bill if the amendment were to pass . \na recent hearing in the subcommittee on criminal justice , drug policy and human resources examining the faith-based initiative demonstrated once again that employment discrimination with federal dollars is one of the initiative 's most controversial and divisive elements . \ntestimony indicated that the continued pursuit of such a rule change is often more about politics than good policy . \nhead start should not be hijacked to promote such an unnecessary and unwise policy . \nreligious organizations and the government have long worked in partnership to perform important social services . \nsuch partnerships are common for head start programs . \nwe support these efforts and recognize the importance of government and religious cooperation generally . \nsuch cooperation has occurred for many years without the danger of government sponsored religious discrimination that is present in the proposed amendment . \nit would be extremely unwise to allow such a dramatic change in policy to threaten the reauthorization of head start . \nwe appreciate your attention to this issue and urge you to oppose any proposal that would allow religious employment discrimination in government funded programs . \nsincerely , k . \nhollyn hollman , american civil liberties union , washington , dc , september 19 , 2005 . \ndear representative : the american civil liberties union strongly urges you to oppose any amendment to repeal longstanding critical civil rights protections contained in the school readiness act ( h.r. 2123 ) and vote `` no '' on final passage if such an amendment is adopted when the bill comes to the floor later this week . \nas unanimously passed out of the committee on education and the workforce , h.r. 2123 maintains longstanding provisions designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded positions in head start programs . \nthe civil rights protections afforded to head start teachers and staff are essential and should not be repealed . \nproposed amendment to h.r. 2123 would repeal longstanding civil rights law that was never controversial we are pleased that the committee-passed head start legislation maintains longstanding critical civil rights protections . \nhowever , we are troubled by the threat of repealing these protections on the house floor . \nin a statement released by the committee on education and the workforce on may 5 , 2005 , the day h.r. 2123 was introduced , chairman boehner stated that he foresaw an amendment on the house floor to roll back longstanding critical civil rights protections . \ncurrent law prohibits participants in head start programs from discriminating based on race , creed [ religion ] , color , national origin , sex , political affiliation or beliefs , or disability . \n42 u.s.c . \n9849 . \nif amended , h.r. 2123 would allow taxpayer dollars to fund religious organizations that discriminate against head start teachers and parent volunteers in federally-funded head start classrooms . \nthe civil rights provision barring federally-funded religious discrimination has never been controversial . \nin fact , the provision was first included in head start legislation that was signed by president richard nixon and subsequently by president ronald reagan . \nthroughout its 33-year history , the civil rights provision has not been an obstacle to the participation of religiously-affiliated organizations in head start programs . \nin fact , many religiously-affiliated organizations participate in head start and comply with the same civil rights provision that applies to everyone else . \nthe proposed amendment to h.r. 2123 would reverse the government 's long fight against federally-funded discrimination repealing critical civil rights protections in head start attacks the very core of civil rights protections historically supported by the federal government . \nmore than 60 years ago , the first success of the modern civil rights movement was a decision by president franklin roosevelt to bar federal contractors from discriminating based on race , religion , or national origin . \nfrom that first presidential decision through the supreme court 's decision allowing the federal government to deny special tax advantages to bob jones university , which claimed a religious right to retain the tax benefits while pursuing racist practices , the federal government has made the eradication of federally-funded discrimination among its highest priorities . \nif amended , h.r. 2123 would allow a religious organization , such as bob jones university , that discriminates based on religion , to participate in federal head start . \nin a disturbing result , bob jones university could be denied tax benefits because of its racist policies toward its students , but could receive federal head start money under h.r. 2123 to discriminate against teachers and parent volunteers working in head start classrooms -- simply because the employees do not meet bob jones university 's religious tests . \nmoreover , in the many religious organizations in which the adherents are all of a single race , the result of federally-funded religious discrimination will effectively be federal funds going to the employment of persons of a single race . \nthe federal government clearly has a compelling interest in applying the head start act 's civil rights provision to everyone receiving federal funds -- including religious organizations seeking to discriminate on the basis of religion in hiring persons to work in head start . \nrepealing critical civil rights protections prohibiting discrimination in employment would be inconsistent with the leading supreme court case on the use of federal funds by religious organizations that discriminate . \nin bob jones univ . \nv. united states , 461 u.s. 574 ( 1983 ) , the supreme court held that federal government could deny a religiously-run university tax benefits because the university imposed a racially discriminatory antimiscegenation policy . \nid . \nat 605 . \nthe court decided that the federal government 's compelling interest in eradicating racial discrimination in education superceded any burden on the university 's religious exercise of enforcing a religiously-motivated ban on students interracial dating . \nid . \nat 604 . \nthere is no meaningful difference between the government prohibiting tax benefits to organizations that discriminate based on race and the head start act 's statutory prohibition on discrimination based on religion in head start classrooms . \nin fact , the united states itself -- during the current administration -- squarely rejected the proposition that intentional religious discrimination gets less protection under the equal protection clause than race . \nin its october 26 , 2001 brief defending the religion prong of title vii from an eleventh amendment attack , the united states stated that `` [ c ] ontrary to defendant 's contention that the supreme court has `distinguished claims involving differential treatment on the basis of race and speech from those involving religion , ' there can be no doubt that the equal protection clause subjects state governments engaging in intentional discrimination on the basis of religion to strict scrutiny. '' brief of intervenor united states in endres v. indiana state police ( n.d . \nind . \noct . \n26 , 2001 ) ( brief is available on www.usdoj.gov ) . \nif critical civil rights protections are repealed , h.r. 2123 would be unconstitutional h.r. 2123 , if amended , would abet unconstitutional employment discrimination based on religion . \nthe proposed amendment 's exemption of religious organizations from the prohibition on religious discrimination in the program is contrary to constitutional law , and will open the door to government-funded discrimination . \nproponents of allowing religious organizations to use federal funds to discriminate against their employees argue that their position is consistent with a provision in title vii of the civil rights act of 1964 that generally permits religious organizations to prefer members of their own religion when making employment decisions . \nhowever , that provision does not consider whether federally-funded religious groups can discriminate with federal taxpayer dollars . \nmoreover , although the supreme court upheld the constitutionality of the religious organization exemption in title vii , corporation of presiding bishop v. amos , 483 u.s. 327 , 336-39 ( 1987 ) , the court has never considered whether it is unconstitutional for a religious organization to discriminate based on religion when making employment several courts have considered whether a religious organization can retain its title vii exemption after receipt of indirect federal funds , e.g. , siegel v. truett-mcconnell college , inc. , 13 f. supp.2d 1335 , 1344 ( n.d . \nga . \n1994 ) ( clarifying that its decision permitting a religious university to invoke the title vii exemption is because the government aid is directed to the students rather than the employer ) , but only one federal court has decided the constitutionality of retaining the title vii exemption after receipt of direct federal funds , dodge v. salvation army , 1989 wl 53857 ( s.d . \nmiss . \n1989 ) . \nin that decision , the court held that the religious employer 's claim of its title vii exemption for a position `` substantially , if not exclusively '' funded with government money in addition to causing the establishment clause violation cited by the court in dodge , h.r. 2210 would also subject the government and any religious employer invoking the right to discriminate with federal dollars to liability for violation of constitutional rights under the free exercise clause and the equal protection clause . \nalthough mere receipt of government funds is insufficient to trigger constitutional obligations on private persons , a close nexus between the government and the private person 's activity can result in the courts treating the private person as a state actor . \nrendell-baker v. kohn , 457 u.s. 830 ( 1982 ) . \nit is beyond question that the government itself can not prefer members of a particular religion to work in a federally-funded program . \nthe equal protection clause subjects governments engaging in intentional discrimination on the basis of religion to strict scrutiny . \ne.g. , united states v. batchelder , 442 u.s. 114 , 125 n.9 ( 1979 ) ; city of new orleans v. dukes , 427 u.s. 297 , 303 ( 1976 ) . \nno government could itself engage in the religious discrimination in employment accommodated and encouraged by the proposed rule 's employment provision . \nthus , the government would be in violation of the free exercise clause and the equal protection clause for knowingly funding religious discrimination . \nof course , a private organization is not subject to the requirements of the free exercise clause and the equal protection clause unless the organization is considered a state actor for a specific purpose . \nwest v. atkins , 487 u.s. 42 , 52 ( 1988 ) . \nthe supreme court recently explained when there is a sufficient nexus between the government and the private person to find that the private person is a state actor for purposes of compliance with constitutional requirements on certain decisions made by participants in the government program : [ s ] tate action may be found if , though only if , there is such a `close nexus between the state and the challenged action ' that seemingly private behavior `may be fairly treated as that of the state itself. ' ... .. \nwe have , for example , held that a challenged activity may be state action when it results from the state 's exercise of `coercive power , ' when the state provides `significant encouragement , either overt or covert , ' or when a private actor operates as a `willful participant in joint activity with the state or its agents ' . \n... .. \nbrentwood academy v. tennessee secondary school athletic association , 121 s. ct . \n924 , ( 2001 ) ( citations omitted ) . \nthe extraordinary role that the current administration -- and the amendment sponsors -- have taken in accommodating , fostering , and encouraging religious organizations to discriminate based on religion when hiring for federally-funded programs creates the nexus for constitutional duties to be imposed on the provider , in addition to the requirements already placed on government itself . \nthe clear intent of this amendment to repeal the civil rights provision in the head start act is to encourage certain providers receiving federal funds to discriminate based on religion . \nthe proposed amendment to h.r. 2123 provision allowing federally-funded religious discrimination is part of a growing pattern of congressional , presidential , and regulatory actions taken specifically for the purpose of accommodating , fostering , and encouraging federally-funded private organizations to discriminate in ways that would unquestionably be unconstitutional if engaged in by the federal government itself . \nfor example , in december of 2002 , president bush signed executive order 13279 , which amended an earlier executive order , which had provided more than 60 years of protection against discrimination based on religion by federal contractors . \nthe bush order provides an exemption for religious organizations contracting with the government to discriminate in employment although religious employers have the right under title vii to apply religious tests to employees , the constitution requires that direct receipt and administration of federal funds removes that exemption . \nin addition , the federal government itself has constitutional obligations to refrain from religious discrimination or from establishing a religion . \nh.r. 2123 , if amended , would fail to meet any of those constitutional mandates . \nfor these reasons , the aclu strongly urges you to vote `` no '' on any proposed amendment to the head start reauthorization ( `` school readiness act '' -- h.r. 2123 ) that would create an unconstitutional loophole allowing federally-funded religious discrimination and to vote `` no '' on final passage if an amendment is adopted . \nthank you for your attention to this matter , and please do not hesitate to call terri schroeder at 202-675-2324 if you have any questions regarding this issue . \nvery truly yours , & lt ; center & gt ; caroline fredrickson , & lt ; /center & gt ; & lt ; center & gt ; director . \n& lt ; center & gt ; terri schroeder , & lt ; /center & gt ; & lt ; center & gt ; senior lobbyist . \nnational league of cities , washington , dc , september 21 , 2005 . \ndear committee member : on behalf of the 18 , 000 cities represented by the national league of cities ( nlc ) , i want to commend members of the education and workforce committee on the passage of bipartisan head start legislation , h.r. 2123 , the `` school readiness act of 2005. '' head start is critical to helping to alleviate the plight of children of the working poor . \nin particular , nlc strongly endorses the committee 's commitment not to include language that would preempt state and local employment laws thereby permitting discrimination in employment by government-funded faith-based social service providers . \nas you know , local governments have a long and rich history of working with faith-based organizations that predates the enactment of the charitable choice provision contained in the welfare-to-work act of 1996 . \nnlc is especially proud of the fact that cities across the nation have carefully helped faith-based groups deliver services to our constituents while respecting the boundaries of our constitution . \npermitting government-funded employment discrimination is the wrong way to encourage faith-based institutions that deliver social services to apply for public funding . \nsimply put , any language that preempts local governments from protecting its residents from employment discrimination undermines the spirit and letter of title vii of the civil rights act and unnecessarily encourages litigation against municipalities . \nnlc asks members of the house of representatives to maintain the committee 's bipartisan direction and oppose any attempts to repeal longstanding anti-discrimination protections during deliberation on the house floor . \nthank you . \nvery truly yours , donald j. borut , national education association , washington , dc . \nseptember 21 , 2005 . \ndear representative : on behalf of the national education association 's ( nea ) 2.7 million members , we would like to offer our views on the school readiness act of 2005 ( h.r. 2123 ) , scheduled for floor debate this week . \noverall , we believe the bill contains a number of positive provisions . \nhowever , we do have some concerns as outlined below . \nin particular , we strongly oppose any amendment to repeal civil rights protections for head start teachers , staff , and volunteers and will oppose the final bill if it does not contain these protections . \nvotes associated with these issues may be included in the nea legislative report card for the 109th congress . \nnea believes that children 's learning begins well before they enter school , and that the transition to school must be founded on strong school readiness . \nhead start has a long history of success in this arena , having provided high-quality early childhood education , health , social services , and parental involvement programs to more than 18.5 million low-income children between the ages of 3 and 5 since its creation in 1964 . \ngiven the critical importance of head start , we are particularly pleased that h.r. 2123 does not allow for block granting of head start funds to states . \nwe are also pleased that the bill would align head start curricula with k-12 education while preserving the comprehensive nature of the head start program . \nwe believe these provisions will support effective transitions for children 's learning and development and ensure that children will enter school ready to learn . \nat the same time , the proposal will provide continuity for children by retaining the essential parental involvement , nutrition , and other nonacademic features of head start . \nwe do have some concerns with portions of h.r. 2123 as drafted as well as proposed amendments : civil rights protections . \nwe are very pleased that h.r. 2123 maintains provisions designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded head start programs . \nwe recognize the invaluable contributions of religious organizations participating in head start . \nhowever , we are deeply concerned that a repeal of civil rights protections could allow religious organizations participating in head start to fire teachers or parent volunteers based on their religion . \nwe strongly believe that allowing discrimination based on religion would significantly impede the important goals of head start as well as send a damaging message to students . \nwe urge your opposition to any amendment , including one expected to be offered by representative boustany , that would repeal civil rights protections for head start employees . \nprofessional development . \nwe are very pleased that h.r. 2123 has a strong focus on early childhood educator professional development . \nwe are concerned , however , that the bill would require teachers to have higher academic degrees , without providing for a substantial increase in funding either for professional development or compensation . \nwe recommend addressing this concern , including by providing grants to help teachers meet the costs of earning their bachelor 's and associates degrees and/or increasing the salaries of those teachers who earn degrees in early childhood education . \nassessments . \nh.r. 2123 allows a study of , and recommendations on , appropriate assessments for young children . \nwe would recommend that the national academy of sciences conduct a review of the national reporting system to ensure that the assessments are comprehensive , reliable , and that the results are used to improve student achievement . \nwe also hope to work with you toward increasing funding authorization levels to ensure that head start can fully serve all eligible low-income children and their families . \nwe thank you for your consideration of our views on these important issues . \n& lt ; center & gt ; diane shust , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; director of government relations . \n& lt ; /em & gt ; & lt ; center & gt ; randall moody , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; manager of federal policy and politics. & lt ; /em & gt ; american humanist association , washington , dc , september 16 , 2005 . \ndear representative : the american humanist association ( aha ) stands in opposition to any retrenchment of existing civil rights protections , and therefore opposes any specific attempt to reverse the nondiscrimination provisions currently in effect in the head start program . \ncongressman john boehner ( oh ) has indicated his intent to roll back vital civil rights protections by introducing , on the house floor , an amendment to h.r. 2123 , the school readiness act . \non behalf of the oldest and largest humanist organization in the nation , i ask you to oppose any such attempt to legalize discrimination with federal funds as you vote on the bipartisan head start reauthorization bill . \nthere is no compelling reason to undo the civil rights protections in the head start program that president nixon signed into law in 1972 . \nif this 33 year old nondiscrimination policy were discarded , the head start reauthorization would permit religious organizations to use federal funds to discriminate on the basis of religion , even when engaging in purely secular early childhood education activities . \nnot only would such a removal of employment discrimination safeguards hold significant potential harm for humanists , jews , muslims , buddhists , and others who hold minority lifestances , it would not address an existing problem . \nfaith-based organizations have been partnering with the government to provide social services for many years without the need to bypass civil rights laws . \nhumanists are particularly concerned about this potential amendment because many dedicated teachers and volunteers in the head start program would find themselves disenfranchised just because they do not happen to believe as others do . \nas a result , this bill will likely lose the existing support of many religious , civil rights , education , health , and advocacy organizations if congressman boehner 's amendment is adopted . \nas humanists we persistently oppose federal funding for discrimination , especially discrimination done on the basis of religion or lack thereof . \nif religious or secular organizations wish to utilize taxpayer dollars to operate on our government 's behalf , they must also abide by the standards set for public service . \nthis is why i write to ask you to oppose any amendment to the legislation that would roll back these critical civil rights protections . \nif such an amendment is added to the bill , we strongly urge you to oppose final passage of the bill . \nshould you have any questions about our position , please do not hesitate to contact roy speckhardt on our staff . \nsincerely , mel lipman , religious discrimination , september 19 , 2005 . \ndear representative : we , the undersigned religious , civil rights , labor , education , health , and advocacy organizations are writing to urge you to oppose any amendment to repeal longstanding critical civil rights protections contained in the school readiness act ( h.r. 2123 ) and vote `` no '' on final passage if such an amendment is adopted . \nas unanimously passed out of the committee on education and the workforce , h.r. 2123 maintains longstanding provisions designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally-funded positions in head start programs . \nthe critical longstanding nondiscrimination provisions have been included in head start legislation since 1972 . \nthis is a fundamental civil rights protection against employment discrimination for head start teachers and volunteers . \nthe legislation always has received strong bipartisan support from both the house and senate since its enactment in the 92nd congress when president nixon signed the legislation into law . \nthe 33 year old civil rights provision has worked effectively since the inception of this program , allowing religious organizations to participate in programs while maintaining constitutional and civil rights standards . \nwe are pleased that the committee-passed head start legislation maintains longstanding critical civil rights protections . \nhowever , we are troubled by the threat of repealing these protections on the house floor . \nin a statement released by the committee on education and the workforce on may 5 , 2005 , the day h.r. 2123 was introduced , chairman boehner stated that he foresaw an amendment on the house floor to roll back longstanding critical civil rights protections . \nthe civil rights protections afforded to head start teachers and staff are vital and should not be dislodged . \nwe recognize that religious organizations participating in the head start program make an invaluable contribution to the education of thousands of students . \nthese religious organizations have complied with head start 's existing civil rights requirements . \nhowever , if the repeal of the existing civil rights protections becomes law , teachers or parent volunteers working in any head start program run by a religious organization could potentially lose their jobs based only on their religion . \nstudents participating in head start therefore could lose not only their teachers , but also the close programmatic connection with their own parents volunteering in the program . \nwe strongly believe that allowing discrimination based on religion would significantly impede the important goals of head start , send a damaging message to head start students , and harm their we urge you to maintain current law and reject any assault on civil rights protections in federally-funded programs , especially a program as critical as head start . \nif these longstanding critical civil rights protections are repealed we urge you to vote `` no '' on final passage of h.r. 2123 . \nthe dismantling of civil rights will destroy the nature of a program in which the education of young children is so dependent on parent participation and on ongoing , close relationships with head start teachers . \nsincerely , african american ministers in action . \namerican association of university women . \namerican civil liberties union . \namerican federation of state , county and municipal employees . \namerican federation of teachers . \namerican humanist association . \namerican jewish committee . \namerican jewish congress . \namerican-arab anti-discrimination committee ( adc ) . \namericans for democratic action . \namericans for religious liberty . \namericans united for separation of church and state . \nbaptist joint committee for religious liberty . \ncentral conference of american rabbis . \nchildren 's defense fund . \nchurch women united . \ncommunications workers of america . \ndisciples justice action network ( disciples of christ ) . \nequal partners in faith . \nfaith action network of people for the american way . \ngay , lesbian and straight education network . \ngeneral board of church and society of the united methodist church . \nhuman rights campaign . \ninternational union , uaw . \nlegal momentum ( formerly now legal defense ) . \nmexican american legal defense and educational fund ( maldef ) . \nnational association of social workers . \nnational center on domestic and sexual violence . \nnational council of jewish women . \nnational council of women 's organizations . \nnational education association . \nnational head start association . \nnational mental health association . \nnational organization of women . \nnational pta . \nnational women 's law center . \nomb watch . \npeople for the american way . \nsecular coalition for america . \nservice employees international union . \nstop family violence . \ntexas faith network . \ntexas freedom network . \nthe interfaith alliance/foundation . \nthe secular coalition for america . \nunion for reform judaism . \nunitarian universalist association of congregations . \nunited church of christ justice & amp ; witness ministries . \nwomen of reform judaism . \nthe interfaith alliance , washington , dc , september 16 , 2005 . \ndear representative : i write to you today as the president of the interfaith alliance , a nonpartisan , national grassroots organization dedicated to promoting the positive and healing role of religion in public life to oppose any amendment to repeal longstanding critical civil rights protections contained in the school readiness act ( h.r. 2123 ) and vote `` no '' on final passage if such an amendment is adopted . \nas unanimously passed out of the committee on education and the workforce , h.r. 2123 maintains longstanding provisions designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination based on religion in federally funded head start programs , as an organization whose membership is comprised of 150 , 000 people of faith and good will spanning 75 faith traditions , i can think of no reason to justify an attempt to roll back these longstanding civil rights and religious liberty protections . \nindeed , in a nation as intentionally and increasingly pluralistic as ours , built-in protections prohibiting religious discrimination in federally-funded programs represent a fundamental commitment towards a society that values the contributions and abilities of people of all faith traditions equally . \nreligious organizations have had a long and proud history in their active participation in head start programs . \nfor years , congregations have made substantial contributions to their communities with the existing workplace protections in place . \nif those in congress who seek to repeal these employment safeguards are successful , thousands of teachers and parent volunteers who have dedicated themselves to this program could find themselves no longer welcome at religiously-affiliated head start programs because they are of a different faith than the sponsoring organization . \nwhile the interfaith alliance is supportive of the right of sectarian organizations to hire based on religious preference for purposes of furthering their institutional ministry , we believe that houses of worship forfeit that right once they accept federal taxpayer dollars to implement social service programs that are intended to serve all . \nfurther , any attempt to politicize the head start program -- a federally sponsored preschool program conceived to meet the needs of disadvantaged children since 1965 -- through a floor amendment to add the highly controversial religious exemption language , is not only unnecessary , but a sad commentary on the state of those political leaders who seek to attach religious exemption language to every social service program that comes before the congress . \nthe interfaith alliance is pleased with the bipartisan direction of the head start legislation however ; this bill will no longer be bipartisan if there is any attempt to roll back longstanding critical civil rights protections . \nthe civil rights protections afforded to head start teachers and staff are vital and should not be dislodged . \nthis bill has gained broad support among religious , civil rights , labor , education , health , and advocacy organizations , but that broad support will end if there is any threat to remove the longstanding critical civil rights protections in head start . \nif you need further information on our position on this matter , please do not hesitate to contact kim baldwin , director of public policy and voter education or preetmohan singh , senior policy analyst , at 202-639-6370 . \nsincerely , & lt ; center & gt ; rev . \ndr . \nc. welton gaddy , & lt ; /center & gt ; & lt ; center & gt ; & lt ; em & gt ; president , the interfaith alliance , pastor of preaching and worship , north minster baptist church ( monroe , la ) . & lt ; /em & gt ; association of congregations , washington , dc , june 1 , 2005 . \ndear member of congress : i am writing on behalf of the over 1 , 050 congregations that make up the unitarian universalist association in regard to h.r. 2123 , the school readiness act of 2005 , the legislation to reauthorize the head start program . \nthe unitarian universalist association would like to express our continued support of this program , as we believe that head start is a successful and necessary program that helps prepare nearly 20 million low-income children for success in kindergarten and later life . \nwe remain pleased with the general direction of the house bill as it comes out of the committee on education and the workforce . \nwe are , however , concerned over proposals by committee leadership to offer a floor amendment to repeal civil rights protections in hiring in head start programs . \nthe uua encourages you to pass a reauthorization bill that is truly bi-partisan in recognizing the successes of the head start program and maintaining the high quality of comprehensive services it provides without repeal of long-standing civil rights protections . \nwe ask that you vote against any amendment on the floor that would repeal civil rights protections . \nif such an amendment is included in the final bill , we ask that you vote no on final passage of h.r. 2123 . \nwe urge you to oppose the repeal of longstanding civil rights protections designed to protect head start teachers , staff , and parent volunteers from employment discrimination based on religion in federally funded head start programs . \nthis provision has worked for 24 years , encouraging religious organizations to participate in head start and make invaluable contributions to children 's education and well-being , while maintaining constitutional and civil rights standards . \nallowing discrimination based on religion would significantly impede the important goals of head start , send a damaging message to head start students , and harm their education by separating students from their own teachers and parent volunteers . \non behalf of the unitarian universalist association of congregations , i thank you for your consideration of our views on head start reauthorization . \nhead start is an exemplary program that has a well-deserved reputation for delivering quality services to millions of our country 's children . \nthis program is an excellent example of how religious organizations such as houses of worship work in partnership with the government without compromising either protections for religious minorities or the integrity of religious organizations . \nwe urge the house to pass a bipartisan bill that will continue the success of head start without eliminating important civil rights provisions by voting no on any proposed amendment eliminating such provisions and voting no on final passage of a bill including such provisions . \nin faith , robert c. keithan , international union , clc , washington , dc , september 20 , 2005 . \ndear representative : on behalf of 1.8 million members of the service employees international union ( seiu ) , working in health care , building services , and federal , state , and local governments , including more than 220 , 000 early education workers throughout the united states , i write to encourage you to take a closer look at several key provisions in the head start reauthorization bill that could impact the quality of head start for children . \nas the school readiness act of 2005 ( h.r. 2123 ) moves to the house floor for a vote this week , we hope that you will use this time as an opportunity to improve the quality of head start programs that serve low-income children nationwide . \nsince its inception in 1965 , the head start program has enrolled more than 22 million children . \nhead start provides an array of comprehensive services to low-income parents and children that they may not otherwise have access to on their own . \nhead start not only prepares children for school by providing a solid foundation in cognitive learning and socialization skills , but also helps make children `` ready to learn '' by providing comprehensive health , dental , and nutritional services critically needed by our at-risk children . \nseiu is committed to ensuring that children who participate in head start acquire the skills that prepare them for healthy , successful lives . \nthis goal will not be realized unless certain steps are taken to improve the head start program . \nthe head start bill passed by the house education and workforce committee contains several provisions that we support including greater set asides for migrant and seasonal workers and native americans , as well as early head start programs . \nhowever , seiu remains concerned about a number of provisions that may erode the quality of head start programs if not modified . \nwe have outlined those concerns below . \nseiu supports continuing education for head start staff ; however , the bill 's requirement for additional training and education for head start staff may not become reality without the quality improvement funding to make the plan attainable . \nwhile seiu supports additional training and education for staff , we believe more funds also need to be provided for that training and education . \nhead start teachers on average make $ 23 , 564 annually . \nfurther , there are no current incentives to retain highly qualified staff in head start programs after attaining degrees . \nadditionally , head start needs sufficient resources to ensure every eligible child can participate and to increase the quality of programs . \ntwo out of five preschool children ( about 800 , 000 ) and 97 percent of infants and toddlers who qualify for early head start can not participate in the program simply because there are not enough resources invested in the program . \nwe support full funding for head start so all eligible children have access to the head start program . \nalso , the bill 's re-competition provisions need improvement . \nseiu is encouraged that the house bill does not require automatic re-competition for every grantee after the end of their grant period . \nhowever , the bill does require re-competition for grantees that have a `` deficiency '' during their grant period -- regardless of whether the deficiency has been resolved or not . \nin addition , the secretary has broad authority in identifying what a `` deficiency '' is , the finding of which would require programs to re-compete their grants . \nsuch uncertainty for all programs -- even those with stellar records of performance -- is counterproductive and would end programs ' ability to do any long-range planning . \nin the event a grantee is unsuccessful in a re-competition , seiu continues to have concerns for existing head start workers who may be displaced by re-competition . \nservices and care-giving relationships for children should not be disrupted . \nmoreover , seiu supports parental involvement in head start programs and encourages members of congress to re-think its plan to diminish the role of policy councils . \npolicy councils offer real parental involvement regarding personnel and budgets . \ndespite the advantages of parental involvement , the house bill changes governance responsibility to the board of directors , with policy councils playing only an advisory or consulting role . \ninstead , congress should recognize that parents provide valuable insight into head start programs and can provide the necessary oversight of head start programs when armed with the proper training . \nseiu supports parental involvement through policy councils . \nfinally , seiu vigorously opposes attempts to include language that would repeal longstanding civil rights protections that prohibit religious-based employment discrimination by head start agencies . \nthe house bill currently maintains a provision designed to protect over 198 , 000 head start teachers and staff and over 1 , 450 , 000 parent volunteers from employment discrimination . \nthis decades old civil rights provision has worked effectively since the inception of this program , allowing religious organizations to participate while maintaining constitutional civil and employment protections . \nthe bill has gained broad support among diverse advocacy organizations , but that support will end if there is a successful effort to remove those protections in head start when the bill goes to the floor . \nseiu asks that you vote against any amendment offered that would roll back critical civil rights protections . \nif such an amendment is included in the final bill , we urge you vote no on final passage of h.r. 2123 . \nseiu remains troubled by the bill as it is currently constructed as outlined in the letter and we will endeavor to improve the legislation when the senate takes up reauthorization . \nagain , should an amendment be offered that allows faith-based organizations to use religious discrimination against teachers , staff and parent volunteers working at head start programs , we urge you to vote no upon final passage of the bill . \nsincerely , anna burger , cdf action council , september 20 , 2005 . \ndear representative : as h.r. 2123 , the school readiness act of 2005 , moves towards a full vote in the house of representatives on thursday , september 22 , the children 's defense fund is pleased to support many of the provisions on which the education and workforce committee has worked so thoughtfully and diligently . \nwe are especially pleased that the committee 's bipartisan bill maintains the integrity of the head start program and the quality performance standards that have helped head start successfully serve over 22 million children since the program began . \nwe are extremely concerned , however , about a religious discrimination amendment that will be offered when the bill comes to the house floor . \nthis unwarranted amendment would repeal the important civil rights protections that currently exist in head start that protect teachers and volunteers working in any head start program run by a religious organization . \nsuch an amendment would significantly hinder the goals of the head start program and the quality of care children receive . \ncdf acknowledges the continuing contribution of faith-based individuals and organizations , which have been the backbone of head start since its inception and have historically embraced serving our most vulnerable children when few others would even consider it . \nthe religious discrimination provision , however , strikes at the very core of civil rights issues that so many of these individuals fought to secure . \nit is imperative that faith-based organizations be subject to the same civil rights laws that all programs who receive federal funding must abide by . \nthe following are concerns raised by the amendment : teachers and staff could be hired based on their religion rather than their qualifications . \ntens of thousands of already at-risk 3- and 4-year-old children could lose their head start teachers , who often are the most important adults , other than their parents , with whom they have established meaningful relationships . \nhead start has been an important source of employment for countless parents , but this provision could result in numerous parents losing their jobs , preventing families of head start children from climbing the ladder out of poverty . \nmany head start volunteers are also parents . \nparent involvement has played a critical role in the success of head start . \nthese volunteers could be let go as well if the provision passes . \nhead start is a critical program for our country 's most vulnerable young children , providing them with valuable tools for future success in life . \nwe are greatly concerned that removing civil rights protections for employees and volunteers would be detrimental to the children and families who benefit from this program . \nwhat message does this send to the head start children when their teachers , staff , and parents are denied opportunities in head start , simply because they do not share the federally-funded employers ' religious beliefs ? \nwhile substantial progress has been made creating a bipartisan bill with many positive provisions , the addition of a religious discrimination amendment would require cdf to oppose h.r. 2123 . \nthank you for your continuing commitment to improving head start and helping it reach more of the vulnerable children and families who benefit from its essential services . \nplease oppose the religious discrimination amendment . \nsincerely yours , washington , dc , september 19 , 2005 . \ndear representative : on behalf of the more than 600 , 000 members of the human rights campaign , we write to express our grave concerns with certain provisions of the school readiness act ( h.r. 2123 ) that we understand may be added as the legislation moves to the floor for a vote . \nwe are particularly concerned with statements made by chairman john boehner ( r-oh ) which indicate that his clear intention is to offer an amendment on the floor adding language to reverse the non-discrimination provisions currently in effect in the head start program . \nwe do not believe it should be legal to discriminate with federal funds . \nwe ask you to oppose any attempt to rollback these civil rights protections , which would undermine the current bipartisan nature of the bill . \nif an amendment is added on the floor which would roll back these civil rights protections , we urge you to oppose final passage of the school readiness act ( h.r. 2123 ) . \nas the nation 's largest gay , lesbian , bisexual and transgender civil rights organization , we oppose using federal funds to discriminate on any basis , including religion , which unfortunately has been used as a proxy for discrimination on the basis of sexual orientation and gender identity . \ntwo prominent cases illustrate this problem : bellmore v. united methodist children 's home and department of human resources of georgia and pedreira v. kentucky baptist homes for children . \nfurther , we are particularly concerned that any provisions that allow federally funded religious discrimination will pre-empt local and state non-discrimination laws that include sexual orientation and gender identity . \nwhile we do not hold a position on the overall legislation , we have serious concerns with a provision that we understand will be offered on the floor that would roll back civil rights protections that have been in place and working effectively since 1972 . \nby abandoning these non-discrimination protections , head start providers would be able to discriminate on the basis of religion in federally funded positions , even when engaging in purely secular early childhood education activities . \nfaith-based organizations have been partnering successfully with the government for a number of years without the need to bypass civil rights laws in their efforts to provide social services . \nwe do not object to faith-based organizations providing education-related services or other social services . \nindeed , we deeply respect the faith community 's vital contribution to care for the most vulnerable among us . \njust as it is important these vital programs continue to provide services , it also remains important that federal funds are not used to discriminate on the basis of religion or sexual orientation or gender identity . \nfor these reasons , we urge you to oppose any amendment to the legislation which would rollback these critical civil rights protections and work to produce a bipartisan bill to reauthorize the head start program . \na vote on an amendment permitting federally funded discrimination will be considered a key vote for the human rights campaign . \nshould you have any questions please do not hesitate to contact angela clements on our staff at ( 202 ) 216-1520 . \nsincerely , & lt ; center & gt ; david m. smith , & lt ; /center & gt ; & lt ; center & gt ; christopher labonte , & lt ; /center & gt ; september 19 , 2005 . \ndear representative : on behalf of the 90 , 000 members and supporters of the national council of jewish women ( ncjw ) , i am writing to ask you to oppose the boehner amendment to h.r. 2123 , the school readiness act of 2005 , and to oppose final passage of the bill if this amendment is adopted . \nncjw has been involved with head start since its inception , and we strongly support the program and h.r. 2123 as passed unanimously by the education and the workforce committee . \nefforts to amend the bill to open the door to religious discrimination would compromise the success of this program . \nncjw believes that taxpayer funds should never be used to subsidize discrimination on any basis . \nsince president nixon signed the head start program into law four decades ago , this acclaimed early childhood education program has included civil rights language protecting head start teachers from employment discrimination . \nthis provision works well , allowing religious organizations to participate in head start while maintaining constitutional and civil rights standards . \nncjw strongly supports the bipartisan effort to reauthorize head start . \nbut the boehner amendment looms as a `` poison pill '' undermining this bipartisanship . \nhouse consideration of h.r. 2123 should focus on meeting the needs of disadvantaged children -- improving policy and providing sufficient funds to extend head start to all eligible children . \nthe boehner amendment is totally unnecessary and interjects a controversial , political issue which has the potential to threaten the bill 's progress . \nthe house of representatives must not roll back critical civil rights protections . \nfor over a century , ncjw has been at the forefront of social change , raising its voice on important issues of public policy . \ninspired by our jewish values , ncjw has been , and continues to be , an advocate for the needs of women , children , and families and a strong supporter of equal rights and protections for everyone . \ni urge you to oppose any amendment allowing employment discrimination and to oppose the underlying bill if such an amendment is included . \nsincerely , phyllis snyder , national council of la raza , washington , dc , september 19 , 2005 . \ndear member of congress : on behalf of the national council of la raza ( nclr ) , the largest national latino civil rights and advocacy organization in the u.s. , i write on an issue of great importance to the hispanic community . \non thursday , the house of representatives is scheduled to vote on legislation to reauthorize the head start program , the `` school readiness act of 2005 '' ( h.r. 2123 ) . \nthis legislation is the result of bipartisan work of the committee on education and the workforce to address much-needed improvements to the program for latino children . \nhowever , nclr is concerned that this bipartisan work will be jeopardized by an amendment that would allow for employment discrimination based on religion in the program . \nnclr has long recognized that head start is a critically important program for ensuring that latino children begin their school careers ready to learn . \nfor these reasons , nclr has pursued a reauthorization agenda focused on ensuring that head start continues to show progress in its effort to eliminate disparities in access and enhance the quality of services for latino and limited-english-proficient ( lep ) children and their families . \nwe are pleased that members from both sides of the aisle supported this agenda and worked to include provisions in h.r. 2123 that significantly improve the program for latinos . \nthese provisions include , but are not limited to , the following : additional resources for migrant and seasonal head start ( mshs ) program expansion , which will allow for thousands of farmworker children to exit the fields and enter the classroom . \nan accountability provision which ensures that head start providers serve new populations in their local communities through enhanced monitoring and evaluations of annual community assessments . \na new requirement that the secretary conduct a study on the status of lep children and their families in head start and early head start programs . \na new requirement that the secretary utilize training and technical assistance funds for activities aimed at assisting head start providers to conduct outreach and improve the quality of services to lep populations , particularly in states with new and rapidly growing lep populations . \na new requirement that all head start parents receive information and services in their home language , when possible . \na new requirement that , in addition to making progress toward acquisition of the english language , leps show progress toward the school readiness indicators outlined in the head start education performance standards . \nin addition , while nclr is pleased with the aforementioned provisions in h.r 2123 , we stand in solidarity with the broader civil rights community in our strong opposition to any amendment that could open the door to employment discrimination based on religion in the head start program . \nforemost , such an amendment is unnecessary for ensuring greater participation from the faith-based sector in the program ; faith-based providers have served as an important partner in head start since the program 's inception . \nmoreover , such an amendment will only serve to deter critical attention and debate away from provisions in the legislation that have garnered strong bipartisan support , such as improvements to the program for latino children . \nwe urge members of congress in closing , nclr affirms its strong support of provisions included in h.r. 2123 which increase access to and improve the quality of head start for latino children . \nwe are certain that these policy changes will go a long way toward ensuring that latino children fully benefit from the program and that head start remains a model for early education into the future . \nsincerely , janet murguia , people for the american way , washington , dc , september 16 , 2005 . \ndear representative : on behalf of the more than 750 , 000 members and supporters of people for the american way , we urge you to maintain the bipartisan direction of h.r. 2123 , the `` school readiness act of 2003 , '' and oppose any attempt to repeal longstanding anti-discrimination protections . \nwe commend you on your bipartisan efforts on head start reauthorization legislation . \nhead start programs not only offer opportunities to thousands of low-income children , they also enrich their communities by providing job opportunities to over a third of the parents whose children have participated in the program . \nas it stands , this bill currently upholds key anti-discrimination provisions that have been part of head start since its inception . \nhowever , in a statement released by the committee on education and the workforce on may 5 , 2005 , chairman boehner stated that he anticipates and supports an amendment on the house floor to rollback longstanding critical civil rights protections . \nthis type of amendment would be a direct attack on bipartisan , anti-discrimination provisions that have been part of head start since its creation in 1981 and can not be tolerated . \npeople for the american way can not support a compromise that does not ensure that the existing civil rights protections in h.r. 2123 are not summarily removed on the house floor . \nproponents of anti-civil rights provisions claim there is a need to exempt religious organizations from anti-discrimination laws in order to protect the religious identity of that organization . \nthis is simply not true . \nfor decades , religious organizations have partnered with the government to provide social services . \nthey have done so by separating their worship and related activities from government-funded social services , and , where necessary , creating a separate non-sectarian 501 ( c ) ( 3 ) organization to provide the services . \nunder this model , religious organizations have provided an invaluable contribution to the education of thousands of head start students and to the communities in which they live . \ncongress should not adopt changes that would alter this beneficial relationship , particularly when there is no evidence that religious organizations are actively seeking the religious exemption in question . \nagain , we are pleased with the bipartisan direction of head start reauthorization legislation . \nhowever , we are concerned with any amendments which would rollback longstanding critical civil rights protections and thereby detrimentally affect head start teachers , students and their parents . \nthe current , delicate balance encouraging the participation of religious organizations and compliance with our constitution should not be disrupted . \nfor these reasons , we urge you to continue efforts to ensure that this legislation remains bipartisan , as well as oppose any attempts to repeal longstanding anti-discrimination provisions in h.r. 2123 . \nsincerely , & lt ; center & gt ; ralph g. neas , & lt ; /center & gt ; & lt ; center & gt ; president . \n& lt ; center & gt ; tanya m. clay , & lt ; /center & gt ; & lt ; center & gt ; deputy director of public policy . \nunion for reform judaism , september 19 , 2005 . \ndear representatives : on behalf of the union for reform judaism , whose 900 congregations across north america encompass 1.5 million reform jews , and the central conference of american rabbis ( ccar ) , whose membership includes over 1800 reform rabbis , i strongly urge you to maintain the bipartisan character of the school readiness act of 2005 ( h.r. 2123 ) by opposing any attempt to repeal longstanding civil rights protections that prohibit faith-based head start centers from discriminating in whom they hire on the basis of religion . \nshould such language be added to the bill , i urge you to vote against final passage . \nwe expect government-funded programs to hire the people who are most qualified , not those whose religious beliefs best match those of an employer . \nthis is especially problematic in relation to head start . \none 's faith does not determine how one reads a book to preschoolers or sings the `` alphabet song. '' to deny children living in poverty the most qualified teacher is nothing short of an attack on head start 's core mission -- preparing children to succeed in school . \nsince its founding , head start has prided itself on the strength of its family involvement component . \nhead start has successfully trained many of its low-income parents to work at head start centers , helping parents rise out of poverty . \nin fact , the family and child experiences survey , prepared in january 2002 for the u.s. department of health and human services , found that over 40 percent of head start staff members had children in their households who were current or former head start participants . \non the day this bill becomes law , faith-based head start programs could fire such staff members because of their religious beliefs . \na head start center could refuse to consider a qualified parent for a job because of the way the parent chooses to worship . \nexperience teaches us that a broad exemption for religious organizations would permit religious groups to use government money to discriminate based on race , sexual orientation , and marital status . \nwe are pleased with the bi-partisan efforts to improve upon previous head start reauthorization attempts . \nhowever , on the day that h.r. 2123 was introduced , representative john boehner ( r-oh ) stated his intention to offer an amendment to roll-back the current civil rights protections within the head start program when the bill is considered by the full house . \nto plainly state such intentions diminishes the much-heralded bipartisan spirit of the bill and undermines the gains made thus far in the mark-up process . \nour tradition includes a story of a teacher whose prayer for rain was answered promptly . \nasked to tell of his special merit , he replied : `` i teach children of the poor as well as of the rich ; i accept no fee from any who can not afford it ; and i have a fishpond to delight the children and to encourage them to do their lessons. '' since 1965 , through its comprehensive services and high quality standards , head start has striven to give millions of children an equal opportunity to succeed in school , nurturing their love of learning and delight in life . \ni urge you to protect such opportunity for our nation 's teachers , parents , and children by opposing any attempt to repeal the civil rights protections in h.r. 2123 . \nrespectfully , dear representative : we , the undersigned religious and religiously affiliated organizations , write to urge you to oppose the planned boehner religious discrimination amendment to the school readiness act ( h.r. 2123 ) , the bill reauthorizing the head start program . \nthe bill approved 48-0 by the house committee on education and the workforce that reaches the house floor is the product of many months of hard work resulting in a strong bipartisan agreement . \nit maintains critical civil rights protections in head start , preventing religious discrimination in federally funded head start positions . \nany attempts to amend the bill and repeal these protections threaten not only the bipartisan spirit of the bill , but the integrity of the head start program itself . \nif the promised boehner amendment passes , we urge you to vote `` no '' to h.r. 2123 . \nwe are disappointed that an otherwise acceptable bill could be jeopardized with such an unwise amendment . \nwe represent a diverse array of religions , covering the political and ideological spectrum . \nwe stand united to oppose this unwarranted attack on a vital civil rights provision that protects over 1.6 million teachers and parent volunteers from having to choose between their religion and their participation in the local head start program . \nthe bipartisan bill that passed unanimously out of the committee on education and the workforce has the potential to garner support from a broad range of groups , including all of the religious groups on this letter , but not if the proposed language is included . \nas religious institutions , we support preserving the autonomy of religious organizations with respect to hiring decisions made in privately funded programs . \nhowever , we also recognize the importance of ensuring that taxpayer dollars do not fund positions connected with the operation of the program itself where candidates may be disqualified because of the religion they practice . \nthe longstanding nondiscrimination provision included in head start legislation since 1972 strikes the appropriate balance between religious autonomy and nondiscrimination . \nfor over three decades , as religious and religiously affiliated organizations , we strive to make the world a better place for the next generation and generations to follow . \nthe head start program is an extremely successful government funded means of achieving this goal , providing opportunities for nearly one million at-risk children each year . \nwe urge you to oppose any effort , such as rep . \nboehner 's planned floor amendment , to change this crucial program by stripping its civil rights protections and allowing providers to discriminate on religious grounds . \nthank you for your consideration of this important matter . \nrespectfully , omb watch , washington , dc , september 16 , 2005 . \ndear representative : omb watch strongly urges you to oppose the any attempt to include `` charitable choice '' provisions in the head start program , which would allow religious organizations to discriminate on the basis of the religion when hiring for federally funded programs . \nreligious organizations play a meaningful role in the delivery of social service programs . \nwe do not question the right of religious organizations to participate in federal programs , nor their ability to avail themselves of an exemption under title vii of the civil rights act of 1964 that allows religious organizations to hire co-religionists with their own money . \nhowever , we do question whether federal dollars should fund discrimination by the very few religious organizations that refuse to follow the same rules that all other organizations participating in federal programs follow . \nalthough religious employers have the right under title vii to apply religious tests to employees , the constitution requires that the direct receipt and administration of federal funds remove that exemption . \nin addition , the federal government has constitutional obligations reinforced by chief justice rehnquist 's majority opinion in bowen v. kendrick , 487 u.s. 589 ( 1988 ) . \nthe court stated that although the constitution does not bar religious organizations from participating in federal programs , it requires ( 1 ) that no one participating in a federal program can `` discriminate on the basis of religion '' and ( 2 ) that all federal programs must be carried out in a `` lawful , secular manner. '' id . \nat 609 , 612 . \nfaith-based and secular grantees face high standards and must be treated equally . \nthe acceptance of federal funds -- taxpayer money -- should require all recipients to practice non-discrimination in hiring as it relates to those funds . \ni urge you to maintain the integrity of religious grantees and prevent government-funded religious discrimination by opposing any attempt to include `` charitable choice '' provisions into the head start program . \nif you have any questions , please contact jennifer lowe at 202-234-8494 . \nthank you for your attention to this matter . \nsincerely , gary bass , "

In [ ]:
# wow that was long. but its the most thankful one, so whatever.

If I'm searching for China and trade, what are the top 3 speeches to read according to the CountVectoriser?


In [94]:
# i sorted by china here, on a lark. lets see if it holds true if i sort by trade
tokens_df[(tokens_df['china']>0) & (tokens_df['trade']>0)].sort_values(by='china', ascending=False)[['china', 'trade']].head(3)


Out[94]:
china trade
294 29 63
27 27 9
267 16 5

In [95]:
tokens_df[(tokens_df['china']>0) & (tokens_df['trade']>0)].sort_values(by='trade', ascending=False)[['china', 'trade']].head(3)


Out[95]:
china trade
294 29 63
136 5 21
45 1 18

In [110]:
# kind of! at any rate, speech 294 seems to be the most china and trade related. lets look at it!
speeches_df['content'][294]


Out[110]:
"mr. speaker , i yield myself such time as i may consume . \nmr. speaker , this is a huge week for the congress , a big week for the house of representatives . \nwe are passing out major postal reform for the first time in years , a highway bill that has been in the making for over 2 congresses now , an energy conference report that has also been in the making for over 2 congresses now ; the opportunity to have at least one and perhaps as many as three appropriations conference reports behind us as we enter the august district work period ; and a central american free trade agreement , as well as a bill that gets tough with china , that finally holds our administration 's feet and the feet of , either party 's feet to the fire , and requires that they monitor and enforce the existing trade agreements that have been enacted by this congress . \nthis bill has been called a smoke screen , it has been called a fig leaf , it has been called a number of demeaning terms . \nbut at the end of the day , this is a real worthwhile enforcement tool that gives members the opportunity to show the folks back home where they are on fair level trade with china . \nthe application of u.s. countervailing duty law on nonmarket economies is not an empty gesture . \na system of comprehensive monitoring of chinese compliance with their trade obligations on intellectual property rights ; market access for our american goods , services , and agriculture ; an accounting of the chinese subsidies ; increased transparency so that we know what the government ownership is , we know what they are subsidizing , we know how much . \nthose are more than fig leaves , mr. speaker . \nit requires reporting by treasury to define the currency manipulation and to analyze the effect of what the chinese did with their new exchange rate mechanism this week . \nthat is not a smoke screen . \na $ 6 million a year increase above the president 's request , up to almost $ 45 million a year for the general counsel and an office of monitoring and compliance . \nthat is not an empty promise . \nthat is a real meaningful resource to improve our ability to track the chinese subsidy and the potential manipulation of the global marketplace that is out of compliance with our trade agreements . \nthe suspension for 3 years of the availability of bonds for new shippers in antidumping cases . \nmeaningful , meaningful reform . \nand funding for the itc and an itc report on the sensitivity of u.s. trade and jobs to the currency policy , something that on a bipartisan basis we have heard a great deal of angst about from members of congress . \nthat is a reflection of what is going on in the countryside that there are genuine fears out there about currency manipulation . \nthis bill gives us an opportunity to get our arms around how extensive that is and what effect the reforms and the step forward the chinese government made this week will have on our economy and our employment base . \nthis is an outstanding bill , mr. speaker . \nwe have debated it now , this is the second day , first on the suspension calendar , admittedly with the belief that it would garner two-thirds support from this chamber in the belief that everyone would share in the need to crack down on chinese abuse of trade agreements , that everyone would agree that we need to put as many tools in the tool kit as possible to enforce and monitor their compliance , to bring about that transparency so that the world community can see what is going on , can see where there are distortions , can see where there is manipulation ; and now it is back today for a straight up-or-down vote . \nyesterday , it got 240 votes . \ntoday , i hope it gets even more . \nyesterday there were 19 democrats who supported it . \nthere were five republicans who opposed it . \nit is a bipartisan effort , bipartisan angst , bipartisan support . \ni urge the members to pass the rule and the underlying bill . \nthe material previously referred to by mr. mcgovern xz4002630 is as follows : previous question for h. res. 387 h.r. 3283 -- united states trade rights enforcement act in the resolution strike `` and ( 2 ) '' and insert the following : `` ( 2 ) the amendment in the nature of a substitute printed in section 2 of this resolution if offered by representative rangel of new york or a designee , which shall be in order without intervention of any point of order or demand for division of the question , shall be considered as read , and shall be separately debatable for 60 minutes equally divided and controlled by the proponent and an opponent ; and ( 3 ) '' at the end of the resolution add the following new section : `` sec . \n2. the amendment by representative rangel referred to in section 1 is as follows : amendment in the nature of a substitute to h.r. 3283 offered by mr. rangel xz4003330 of new york strike all after the enacting clause and insert the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; section 1. short title. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; this act may be cited as the `` fair trade with china act of 2005 '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n2. findings. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; the congress finds as follows : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) the growth of the economy of the people 's republic of china is one of the most important developments of the 21st century. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) the bilateral trade relationship between the united states and china is heavily imbalanced and is undermining the long-term economic health of the united states. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 3 ) the united states trade deficit with china has doubled since 2000 , reaching $ 162 , 000 , 000 , 000 in 2004 , the largest bilateral trade deficit in the world. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 4 ) as a consequence of the trade deficit , the united states has had to borrow massive amounts of money from foreign governments. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 5 ) the united states has accumulated more debt to foreign countries since 2000 than in the first 220 years of the country 's history. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 6 ) china has become a major purchaser of united states treasury bonds , and united states indebtedness to the government of china has grown by more than $ 100 , 000 , 000 , 000 since 2000. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 7 ) the large amounts of united states dollars accumulated by the government of china contribute to china 's acquisitions of united states companies , such as the proposed acquisition of unocal corporation by the china national offshore oil corporation. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 8 ) china continues to violate many of the commitments it made when it joined the world trade organization in 2001. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 9 ) china 's inadequate enforcement of intellectual property rights is resulting in infringement levels of 90 percent or more for nearly all forms of intellectual property , and cost american companies more than $ 2 , 500 , 000 , 000 in lost sales in 2004. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 10 ) china 's industrial policies discriminate against foreign firms and products. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 11 ) the government of china continues to heavily subsidize its manufacturing sector through tax incentives , preferential access to credit and capital , subsidized utilities , and other measures. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 12 ) since 1994 , china has kept its currency pegged at approximately 8.3 renminbi to the united states dollar , which has caused the renminbi to become undervalued against the dollar by as much as 40 percent , harming exports of united states goods and services to china and providing an unfair advantage to chinese exports to the united states. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 13 ) current policies of the united states have failed to advance and protect the interests of american workers , farmers , and businesses in the united states-china trade relationship , failed to address effectively china 's unfair trade practices and market access barriers to goods and services and its poor record at protecting intellectual property rights , and failed to stem or reverse the unsustainable united states trade deficit with china . \n& lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 14 ) it is critical that the united states develop and implement a comprehensive and coherent set of policies to address china 's unfair trading practices and failure to abide by its commitments as a member of the world trade organization. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n3. application of countervailing duties to nonmarket economy countries. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) in general. -- section 701 ( a ) ( 1 ) of the tariff act of 1930 ( 19 u.s.c . \n1671 ( a ) ( 1 ) ) is amended by inserting `` ( including a nonmarket economy country ) '' after `` country '' each place it appears. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) effective date. -- the amendments made by subsection ( a ) apply to petitions filed under section 702 of the tariff act of 1930 on or after the date of the enactment of this act. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( c ) antidumping provisions not affected. -- the amendments made by subsection ( a ) shall not affect the status of a country as a nonmarket economy country for purposes of any matter relating to antidumping duties under the tariff act of 1930. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n4. treatment of currency manipulation. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) definition of unjustifiable acts , policies , and practices. -- section 301 ( d ) ( 4 ) ( b ) of the trade act of 1974 ( 19 u.s.c . \n2411 ( d ) ( 4 ) ( b ) ) is amended to read as follows : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) ( i ) acts , policies , and practices that are unjustifiable include , but are not limited to , any act , policy , or practice described in subparagraph ( a ) which involves currency manipulation , or denies national or most-favored nation treatment or the right of establishment or protection of intellectual property rights. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( ii ) in this subparagraph , the term `currency manipulation ' means the protracted large-scale intervention by an authority to undervalue its currency in the exchange market that prevents effective balance of payments adjustment or gains an unfair competitive advantage over the united states. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) investigation into currency manipulation by the people 's republic of china. -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) investigation , determinations , actions. -- the united states trade representative shall -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) conduct an investigation , under sections 302 and 303 of the trade act of 1974 , of the currency practices of the people 's republic of china ; & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) make the applicable determinations under section 304 of that act pursuant to that investigation ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( c ) implement any action , under section 305 of that act , in accordance with such determinations. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) initiation of investigation. -- the united states trade representative shall initiate the investigation required by paragraph ( 1 ) not later than 90 days after the date of the enactment of this act. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n5. clarification of standard for presidential action on itc finding of market disruption. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) amendments to standard for trade representative 's recommendation to the president. -- section 421 ( h ) ( 2 ) of the trade act of 1974 ( 19 u.s.c . \n2451 ( h ) ( 2 ) ) is amended -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) by striking `` ( 2 ) within '' and inserting `` ( 2 ) ( a ) within '' ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) by adding at the end the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) in making a recommendation to the president under subparagraph ( a ) , the trade representative shall consider the facts found , or conclusions drawn , by the commission as they are reported to the trade representative , and the trade representative may not conduct an additional review or reconsideration of the facts found or conclusions reached by the commission. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( c ) if the commission in its report makes an affirmative finding of market disruption , the trade representative shall apply a presumption in favor of relief to prevent or remedy the market disruption. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( d ) the following factors may not be used as the basis of a recommendation by the trade representative to recommend denying relief under this section : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( i ) the presence or absence ( whether actual or potential ) of third-country imports of the product under investigation. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( ii ) any results of the econometric model known as the commercial policy analysis system ( compas ) or equivalent model. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) amendments to standard for presidential action. -- section 421 ( k ) of the trade act of 1974 ( 19 u.s.c . \n2451 ( k ) ) is amended by adding at the end the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 3 ) the president 's determination shall be based on the facts found , or conclusions drawn , by the commission as they are reported to the trade representative under subsection ( g ) . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 4 ) if the commission in its report makes an affirmative finding of market disruption , the president shall apply a presumption in favor of relief to prevent or remedy the market disruption. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 5 ) any determination by the president under paragraph ( 1 ) that providing import relief is not in the national economic interest of the united states may not be based on the following factors : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( a ) the presence or absence ( whether actual or potential ) of third-country imports of the product under investigation. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) any results of the econometric model known as the commercial policy analysis system ( compas ) or equivalent model. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n6. identification of trade expansion priorities. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) identification of trade expansion priorities. -- section 310 of the trade act of 1974 is amended to read as follows : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; '' sec . \n310 . \nidentification of trade expansion priorities. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( a ) identification. -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 1 ) identification and report. -- within 30 days after the submission in each calendar year of the report required by section 181 ( b ) , the trade representative shall -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( a ) review united states trade expansion priorities ; & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) identify priority foreign country practices , the elimination of which is likely to have the most significant potential to increase united states exports , either directly or through the establishment of a beneficial precedent ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( c ) submit to the committee on finance of the senate and the committee on ways and means of the house of representatives and publish in the federal register a report on the priority foreign country practices so identified. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 2 ) factors. -- in identifying priority foreign country practices under paragraph ( 1 ) , the trade representative shall take into account all relevant factors , including -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( a ) the major barriers and trade distorting practices described in the national trade estimate report required under section 181 ( b ) ; & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) the trade agreements to which a foreign country is a party and its compliance with those agreements ; & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( c ) the medium- and long-term implications of foreign government procurement plans ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( d ) the international competitive position and export potential of united states products and services. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 3 ) contents of report. -- the trade representative may include in the report , if appropriate -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( a ) a description of foreign country practices that may in the future warrant identification as priority foreign country practices ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) a statement about other foreign country practices that were not identified because they are already being addressed by provisions of united states trade law , by existing bilateral trade agreements , or as part of trade negotiations with other countries , and because progress is being made toward the elimination of such practices. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( b ) initiation of consultations. -- by no later than the date that is 21 days after the date on which a report is submitted to the appropriate congressional committees under subsection ( a ) ( 1 ) , the trade representative shall seek consultations with each foreign country identified in the report as engaging in priority foreign country practices for the purpose of reaching a satisfactory resolution of such priority practices. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( c ) initiation of investigation. -- if a satisfactory resolution of priority foreign country practices has not been reached under subsection ( b ) within 90 days after the date on which a report is submitted to the appropriate congressional committees under subsection ( a ) ( 1 ) , the trade representative shall initiate under section 302 ( b ) ( 1 ) an investigation under this chapter with respect to such priority foreign country practices . \n`` ( d ) agreements for the elimination of barriers. -- in the consultations with a foreign country that the trade representative is required to request under section 303 ( a ) with respect to an investigation initiated by reason of subsection ( c ) , the trade representative shall seek to negotiate an agreement that provides for the elimination of the practices that are the subject of the investigation as quickly as possible or , if elimination of the practices is not feasible , an agreement that provides for compensatory trade benefits. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( e ) reports. -- the trade representative shall include in the semiannual report required by section 309 a report on the status of any investigations initiated pursuant to subsection ( c ) and , where appropriate , the extent to which such investigations have led to increased opportunities for the export of products and services of the united states. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) initial report on chinese practices. -- not later than 90 days after the date of the enactment of this act , the united states trade representative shall identify , and report to the congress on , priority foreign trade practices of the people 's republic of china , in accordance with section 310 of the trade act of 1974 , as amended by subsection ( a ) of this section . \n( c ) conforming amendment. -- the item relating to section 310 in the table of contents of the trade act of 1974 is amended to read as follows : & lt ; p & gt ; `` sec..310..identification of trade expansion priorities. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n7. requirement of cash deposits. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; section 751 ( a ) ( 1 ) ( b ) of the tariff act of 1930 ( 19 u.s.c . \n1675 ( a ) ( 2 ) ( b ) ) is amended -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) by striking clause ( iii ) ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) by redesignating clause ( iv ) as clause ( iii ) . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n8. itc investigation . \n( a ) investigation. -- the united states international trade commission shall conduct a study , under section 332 of the tariff act of 1930 ( 19 u.s.c . \n1332 ) , regarding how the people 's republic of china uses government intervention to promote investment , employment , and exports . \nthe study shall comprehensively catalog , and when possible quantify , the practices and policies that central , provincial , and local government bodies in the people 's republic of china use to support and to attempt to influence decisionmaking in china 's manufacturing enterprises and industries . \nchapters of this study shall include , but not be limited to , the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) privatization and private ownership. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) price coordination. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 3 ) targeting of industries . \n& lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 4 ) banking and finance. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 5 ) utility rates . \n& lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 6 ) infrastructure development. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 7 ) taxation. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 8 ) restraints on imports and exports. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 9 ) research and development. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 10 ) worker training and retraining. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 11 ) rationalization and closure of uneconomic enterprises. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) timing of reports on investigation. -- the congress requests that -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) not later than 9 months after the date of the enactment of this act , the international trade commission complete its investigation under subsection ( a ) and submit a report on the investigation to the committee on ways and means of the house of representatives and the committee on finance of the senate ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) not later than 1 year after the report under paragraph ( 1 ) is submitted , and annually thereafter through 2016 , the international trade commission prepare and submit to the committees referred to in paragraph ( 1 ) an update of the report. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n9. amendments relating to international financial policy. & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( a ) bilateral negotiations. -- section 3004 ( b ) of the exchange rates and international economic policy coordination act of 1988 ( 22 u.s.c . \n5304 ( b ) ) is amended in the second sentence by striking `` ( 1 ) have material global account surpluses ; and ( 2 ) '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( b ) definition of manipulation. -- section 3006 of the exchange rates and international economic policy coordination act of 1988 ( 22 u.s.c . \n5306 ) is amended by adding at the end the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 3 ) manipulation of rate of exchange. -- a country shall be considered to be manipulating the rate of exchange between its currency and the united states dollar if there is a protracted large-scale intervention by an authority to undervalue its currency in the exchange market that prevents effective balance of payments adjustment or gains an unfair competitive advantage over the united states. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( c ) report. -- section 3005 ( b ) of the exchange rates and international economic policy coordination act of 1988 ( 22 u.s.c . \n5305 ( b ) ) is amended -- & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 1 ) by striking `` and '' at the end of paragraph ( 7 ) ; & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 2 ) by striking the period at the end of paragraph ( 8 ) and inserting `` ; and '' ; and & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; ( 3 ) by adding at the end the following : & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; `` ( 9 ) a detailed explanation of the test the secretary uses to determine whether or not a country is manipulating the rate of exchange between that country 's currency and the dollar for purposes of preventing effective balance of payments adjustment or gaining an unfair competitive advantage over the united states. '' . & lt ; p & gt ; & amp ; nbsp ; & amp ; nbsp ; & amp ; nbsp ; sec . \n10 . \nwithdrawal of normal trade relations treatment from the people 's republic of china . \nnotwithstanding the provisions of title i of public law 106-286 , title iv of the trade act of 1974 , or any other provision of law , effective on the date of the enactment of this act , normal trade relations treatment shall not apply to the products of the people 's republic of china , and normal trade relations treatment may not thereafter be extended to the products of that country. & lt ; p & gt ; mr. speaker , i yield back the balance of my time , and i move the previous question on the resolution . \n"

In [ ]:
# thats another super long speech, but it does seem to mostly be about trade and china.

Now what if I'm using a TfidfVectorizer?


In [108]:
l2_vectorizer = TfidfVectorizer(stop_words='english', use_idf=True)
X = l2_vectorizer.fit_transform(speeches_df['content'])
tfidf_tokens_df = pd.DataFrame(X.toarray(), columns=l2_vectorizer.get_feature_names())
china_trade_df=pd.DataFrame([tfidf_tokens_df['china'], tfidf_tokens_df['trade'], tfidf_tokens_df['china'] + tfidf_tokens_df['trade']], index=["china", "trade", "china + trade"]).T
china_trade_df[china_trade_df.any(axis=1)].sort_values(by='china + trade', ascending=False).head(3)


Out[108]:
china trade china + trade
636 0.438664 0.470697 0.909362
447 0.516963 0.346696 0.863658
690 0.418276 0.439404 0.857680

In [ ]:
# wow, that comes up with a totally different list of speeches. lets look at speech 636

In [109]:
speeches_df['content'][636]


Out[109]:
"madam speaker , i rise today in opposition to h.r. 3283 , the so-called united states trade rights enforcement act . \nthis bill purports to address china 's lax enforcement of its international trade obligations . \nin fact , this bill does little to address serious trade issues with china , and it is on the house floor for only one reason : to garner votes for cafta later this week . \nthere is no question that congress should do everything in its power to enforce trade rights worldwide . \nhowever , giving lip service to an issue that deserves our careful consideration and strong action is a grave disservice to the american people . \nwhat we should be talking about today is the bush administration 's continued failure to decrease our trade deficits and promote labor rights , environmental standards and public health protections with our trading partners . \nlet 's look at the facts : in 2004 , the u.s. trade deficit with china grew to a record $ 162 billion . \nthis despite the fact that china joined the world trade organization , wto , in 2001 and should be well on its way to reducing trade barriers and opening up their markets to u.s. goods and services . \neven the united states trade representative has said that china 's wto compliance efforts are `` far from complete and have not always been satisfactory. '' given these facts , i support strong trade enforcement against china . \ni am a cosponsor of h.r. 1498 , the chinese currency act , which would allow the administration to impose countervailing duties due to china 's continued currency manipulation . \nthe bill has 110 bipartisan cosponsors and provides real enforcement mechanisms , instead of the studies and redefinitions offered by h.r. 3283 . \nif the leadership were serious about china we would be voting on this meaningful legislation today . \nbut , that is not the case . \nmadam speaker , we have known about trade enforcement issues in china for years . \nbut china legislation magically appears only now that cafta is in trouble . \ni urge my colleagues to vote against this sham bill . \n"

In [ ]:
# that one is very short by comparison and this time, its really only about china and trade

What's the content of the speeches? Here's a way to get them:


In [111]:
# index 0 is the first speech, which was the first one imported.
paths[0]


Out[111]:
'convote_v1.1/data_stage_one/development_set/052_400095_1479080_ROY.txt'

In [112]:
# Pass that into 'cat' using { } which lets you put variables in shell commands
# that way you can pass the path to cat
!cat {paths[0]}


mr. chairman , i yield myself such time as i may consume . 
mr. chairman , i heard my colleague from virginia say the cost is now up to three quarters of a million dollars . 
i do not think we are getting rid of the police officers ; i think we are just moving the five horses . 
their salaries , i think , would be fungible . 
so i do not think you can count that . 
as far as being something we do not need because the park police are already out there with their horses , let me state that the capitol grounds are statutorily defined , and because of that the park police do not have jurisdictions over the capitol grounds , it is my understanding . 
this program has only been in existence and operational since may of 2004 . 
the gao study , as the chairman stated , said that it is hard for them to quantify the benefits of the horse patrol because the performance measures are evolving , he failed to say the rest of it , and that data is still being collected on these measures . 
so we are trying to get rid of something that we have not even given a chance to see if it works . 
we are talking about $ 155 , 000 . 
i am quoting from the gao results that they gave when they appeared before the committee on appropriations . 
the horses right now are housed , i heard my colleague from virginia say earlier , that they were housed 20 miles away . 
that is correct , they are . 
and he says that they have to be under stress whenever they are in traffic . 
well , i am a horsewoman . 
i have seven horses of my own . 
let me tell you , it does not cost me $ 155 , 000 for seven horses . 
we have five horses here , and it certainly does not cost three-quarters of a million dollars , and we do not have to provide health benefits and retirement and the like to the horses . 
i think we are cutting short a program that we have not given a chance . 
i urge my colleagues to support my amendment . 
i think it is a good cause . 
i think the horses do a great job . 
it is great pr for us . 
i see folks going up and talking to our capitol police officers . 
yes , the police officers do have the bicycles , but i would venture to say the guys on the bicycles are not sitting up as high as the guys and gals on top of the horses . 
so if there is a problem , they can not see over the cars ; they can not see through the crowds . 
i am pretty passionate about this whole situation . 
yes , i am . 
i just do not think we have given this program the time it needs to really be evaluated , and i go back to what the gao study says , that it is still evolving . 
i will remind members in the gao study they do not recommend eliminating the mounted horse patrol . 
that is critical . 
they do not recommend eliminating it . 
give it time . 
let us let them have their day . 
mr. chairman , i yield back the balance of my time . 

In [ ]:
# i guess i probably should have read ahead to this part. oh well, dumping the index of the speeches_df 
# was still mostly readable

Now search for something else! Another two terms that might show up. elections and chaos? Whatever you thnik might be interesting.


In [128]:
election_chaos_df=pd.DataFrame([tfidf_tokens_df['election'], tfidf_tokens_df['chaos'], tfidf_tokens_df['election'] + tfidf_tokens_df['chaos']], index=["election", "chaos", "election + chaos"]).T
election_chaos_df[election_chaos_df.any(axis=1)].sort_values(by='chaos', ascending=False).head(10)


Out[128]:
election chaos election + chaos
257 0.051012 0.078108 0.129120
382 0.044475 0.068098 0.112573
701 0.148767 0.045557 0.194324
467 0.065667 0.000000 0.065667
352 0.076376 0.000000 0.076376
424 0.179072 0.000000 0.179072
426 0.114181 0.000000 0.114181
459 0.220375 0.000000 0.220375
469 0.105802 0.000000 0.105802
302 0.032576 0.000000 0.032576

In [129]:
# i did the sort that way because i guess they dont talk about chaos much. lets look at that speech
!cat {paths[257]}


mr. chairman , i yield myself 45 seconds . 
mr. chairman , this is about chaos and confusion . 
there is no definition of how the announcement will go out to the people beyond the beltway . 
a mere extending from 2 days to 5 days to make sure that americans , even in crisis , have due process and democracy and justice is not too much to ask . 
i would indulge and beg my colleagues to realize all this does is simply allow for the people of america in crisis to be represented and to be responded to . 
mr. chairman , i yield 30 seconds to the gentlewoman from california ( ms. millender-mcdonald ) xz4002750 , the ranking member of the committee on house administration . 
ms. millender-mcdonald . 
mr. chairman , i rise in strong support of the jackson-lee amendment . 
a portion of the gentlewoman 's amendment seeks to provide an expedited appeals process to the united states district court for matters arising out of the special election process . 
we have been talking about this 44 , 45 , 49-day deadline for special state elections , and it already places significant constraints on the electoral process and on the citizens represented due to its brevity . 
taking away the right of an appeal to united states district court would excessively curtail the procedural due process rights enjoyed by citizens . 
i support the gentlewoman 's amendment . 
mr. chairman , i yield myself the balance of my time , and thank the gentlewoman for her support . 
again , the idea of this amendment , in the judicial review aspect , one , there is no definitive information about how the information will be disseminated to our states and to citizens in a 2-day period if crisis is occurring , if a terrorist act has occurred . 
my amendment gives an additional 5 days to guarantee that that notice be given . 
in addition , the other aspects of the legislation provides for an expedited time frame . 
it does not in any way cause a sufficient delay that would not allow us to restore this body to its ability to do business on behalf of the american people . 
continuity , tragedy , all equal bipartisanship . 
i would ask my colleagues to look at this amendment and all it does provide , the enhanced due process . 
and i think we would not want the terrorists to believe that because of a terrorist act that we have lost our sense of judgment , the constitution and due process . 
after 9/11 , we went to new york to show that we are not afraid of the terrorists . 
i believe we should show that we are not afraid of them by upholding the constitution and due process on behalf of the american people . 
vote for the jackson-lee amendment . 
i ask my colleagues to vote for this amendment . 
mr. chairman , i yield back the balance of my time . 

In [146]:
# thats pretty weak. i tried to come up with something spicier but i cant tell what years these are from.
# how about this?
clinton_welfare_df=pd.DataFrame([tfidf_tokens_df['clinton'], tfidf_tokens_df['welfare'], tfidf_tokens_df['clinton'] + tfidf_tokens_df['welfare']], index=["clinton", "welfare", "clinton + welfare"]).T
clinton_welfare_df[clinton_welfare_df.any(axis=1)].sort_values(by='clinton + welfare', ascending=False).head(10)


Out[146]:
clinton welfare clinton + welfare
346 0.498457 0.182480 0.680938
214 0.418044 0.229563 0.647607
107 0.314363 0.115085 0.429447
356 0.172899 0.047472 0.220371
67 0.177291 0.000000 0.177291
96 0.129783 0.000000 0.129783
31 0.053955 0.059258 0.113213
612 0.107158 0.000000 0.107158
402 0.076199 0.000000 0.076199
560 0.000000 0.050511 0.050511

In [139]:
!cat {paths[346]}


mr. chairman , in the original welfare reform bill by president clinton , this provision was never in it . 
second , it was unconstitutional , and it was never promulgated by president clinton in the rulemaking . 
he does not support that provision . 
if you want to support something that president clinton believed in , then try fiscal responsibility and start balancing the budget . 
this is not what he believes , and the gentleman from ohio knows that , mr. chairman . 

In [ ]:
# still pretty dull. oh well.

Enough of this garbage, let's cluster

Using a simple counting vectorizer, cluster the documents into eight categories, telling me what the top terms are per category.

Using a term frequency vectorizer, cluster the documents into eight categories, telling me what the top terms are per category.

Using a term frequency inverse document frequency vectorizer, cluster the documents into eight categories, telling me what the top terms are per category.


In [150]:
from sklearn.cluster import KMeans

In [151]:
count_vectorizer = CountVectorizer(stop_words='english')
X=count_vectorizer.fit_transform(speeches_df['content'])
number_of_clusters = 8
km = KMeans(n_clusters=number_of_clusters)
km.fit(X)


Out[151]:
KMeans(copy_x=True, init='k-means++', max_iter=300, n_clusters=8, n_init=10,
    n_jobs=1, precompute_distances='auto', random_state=None, tol=0.0001,
    verbose=0)

In [153]:
print("Top terms per cluster:")
order_centroids = km.cluster_centers_.argsort()[:, ::-1]
terms = count_vectorizer.get_feature_names()
for i in range(number_of_clusters):
    top_ten_words = [terms[ind] for ind in order_centroids[i, :5]]
    print("Cluster {}: {}".format(i, ' '.join(top_ten_words)))


Top terms per cluster:
Cluster 0: mr chairman time amendment gentleman
Cluster 1: head start religious rights civil
Cluster 2: nbsp amp lt gt trade
Cluster 3: association national restaurant contractors chamber
Cluster 4: rule 11 rules federal 420
Cluster 5: mr house people time trade
Cluster 6: start head children program amendment
Cluster 7: environmental justice agency executive epa

In [154]:
tf_vectorizer = TfidfVectorizer(stop_words='english', use_idf=False)
X = tf_vectorizer.fit_transform(speeches_df['content'])
number_of_clusters = 8
km = KMeans(n_clusters=number_of_clusters)
km.fit(X)


Out[154]:
KMeans(copy_x=True, init='k-means++', max_iter=300, n_clusters=8, n_init=10,
    n_jobs=1, precompute_distances='auto', random_state=None, tol=0.0001,
    verbose=0)

In [155]:
print("Top terms per cluster:")
order_centroids = km.cluster_centers_.argsort()[:, ::-1]
terms = tf_vectorizer.get_feature_names()
for i in range(number_of_clusters):
    top_ten_words = [terms[ind] for ind in order_centroids[i, :5]]
    print("Cluster {}: {}".format(i, ' '.join(top_ten_words)))


Top terms per cluster:
Cluster 0: yield gentleman texas illinois wisconsin
Cluster 1: mr chairman yield gentleman minutes
Cluster 2: mr chairman amendment time gentleman
Cluster 3: start head children amendment program
Cluster 4: time mr chairman balance yield
Cluster 5: china trade speaker mr legislation
Cluster 6: mr speaker yield gentleman time
Cluster 7: horses wild mr chairman amendment

In [156]:
tfidf_vectorizer = TfidfVectorizer(stop_words='english', use_idf=True)
X = tfidf_vectorizer.fit_transform(speeches_df['content'])
number_of_clusters = 8
km = KMeans(n_clusters=number_of_clusters)
km.fit(X)


Out[156]:
KMeans(copy_x=True, init='k-means++', max_iter=300, n_clusters=8, n_init=10,
    n_jobs=1, precompute_distances='auto', random_state=None, tol=0.0001,
    verbose=0)

In [157]:
print("Top terms per cluster:")
order_centroids = km.cluster_centers_.argsort()[:, ::-1]
terms = tfidf_vectorizer.get_feature_names()
for i in range(number_of_clusters):
    top_ten_words = [terms[ind] for ind in order_centroids[i, :5]]
    print("Cluster {}: {}".format(i, ' '.join(top_ten_words)))


Top terms per cluster:
Cluster 0: demand recorded vote mr speaker
Cluster 1: yield gentleman mr chairman minutes
Cluster 2: start head children program amendment
Cluster 3: mr amendment chairman time gentleman
Cluster 4: frivolous lawsuits civil religious federal
Cluster 5: claim consent opposition ask unanimous
Cluster 6: china trade speaker madam cafta
Cluster 7: balance time chairman reserve mr

In [ ]:

Which one do you think works the best?


In [ ]:
# well, it seems to be between results including wild horses and those including frivolous lawsuits.
# i prefer wild horses but the tfidf is probably the best representation of the actual document

Harry Potter time

I have a scraped collection of Harry Potter fanfiction at https://github.com/ledeprogram/courses/raw/master/algorithms/data/hp.zip.

I want you to read them in, vectorize them and cluster them. Use this process to find out the two types of Harry Potter fanfiction. What is your hypothesis?


In [164]:
paths = glob.glob('hp/hp/*')
paths[:5]


Out[164]:
['hp/hp/9586935.txt',
 'hp/hp/10608415.txt',
 'hp/hp/10608060.txt',
 'hp/hp/9973627.txt',
 'hp/hp/10602965.txt']

In [165]:
hp_fics = []
for path in paths:
    with open(path) as hp_file:
        hp_fic = {
            'pathname': path,
            'filename': path.split('/')[-1],
            'content': hp_file.read()
        }
    hp_fics.append(hp_fic)
hp_fics_df = pd.DataFrame(hp_fics)
hp_fics_df.head()


Out[165]:
content filename pathname
0 Hello, my name is Malcolm Hargreaves, but most... 9586935.txt hp/hp/9586935.txt
1 I do not own Harry Potter or the Darren Shan s... 10608415.txt hp/hp/10608415.txt
2 This is my entry for the "Three Prompts" compe... 10608060.txt hp/hp/10608060.txt
3 Author's Notes: In my own, happy little world,... 9973627.txt hp/hp/9973627.txt
4 The roommates.Harry, Ron, Seamus, Dean, and ev... 10602965.txt hp/hp/10602965.txt

In [166]:
tfidf_vectorizer = TfidfVectorizer(stop_words='english', use_idf=True)
X = tfidf_vectorizer.fit_transform(hp_fics_df['content'])
number_of_clusters = 2
km = KMeans(n_clusters=number_of_clusters)
km.fit(X)


Out[166]:
KMeans(copy_x=True, init='k-means++', max_iter=300, n_clusters=2, n_init=10,
    n_jobs=1, precompute_distances='auto', random_state=None, tol=0.0001,
    verbose=0)

In [167]:
print("Top terms per cluster:")
order_centroids = km.cluster_centers_.argsort()[:, ::-1]
terms = tfidf_vectorizer.get_feature_names()
for i in range(number_of_clusters):
    top_ten_words = [terms[ind] for ind in order_centroids[i, :5]]
    print("Cluster {}: {}".format(i, ' '.join(top_ten_words)))


Top terms per cluster:
Cluster 0: harry hermione draco said just
Cluster 1: lily james sirius remus said

In [ ]:
# i would say people are either writing about harry and hermione or lily and james
# ive never read harry potter, but i think that means either the harry generation or their parents generation?
# seems legit